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pSPRH830b E.co/f-Micromonospora shuttle 




pSPRH830b - pSPRH826b backbone 



Function 

- Ampicillin resistance 

- Multiple cloning site 

- pUC18 origin 

- Hygromycin resistance 

- oriT (origin of transfer) 

- pIJ702 origin of replication 



Source 
(pUC18) 
(pUC18) 
(pUC18) 
(pl6Rl) 
(pRL1058) 
(pU702) 



pSPRH840 integrating vector 

ure 7A 




pSPRH840 - pSPRH826b backbone, pMLPl xis, int attP insert 
pSPRH840 conjugated 

from E. coli into HmR transformants obtained 

M.carbonacea + 

M.rosaria 

M.halophitica + 



1 CTACCTCXXXTTKSATCCTreXX^ 

tn GGccexrKxrraayxTGCXSOG^ 

233 (STOXOCOCOG^OSCX^Gf^^ 

465 TC^TGCGGCGGCCTCCyACAGGTCCTC^^ 

813 ACACCTATGAAQGCyAGCAAGATGCrCG^ 
929 GGTCCXMCCTQOSCm'ACGAC^ 

1» M RNTPGLGRGTWAAYVLTARERAGLTk 

1044 AGCGACTTGGCCAGGCGCATCC^ 

28* SELARRt QKDRATVGRWEDGKNRPDOADLVARVAQVLGL 

MOU OEEI ELVRT 

1276 ACGAGGACATGAAGCGGCGCATC^TCG^ 
105* D EDMKRR1 IALI LERREROKAAAf EETKRLt DLFRRS • 

1> V W I EKNGPVYR1 RDLVRGKKVT1 QTGYPTKTSAKNAM 
1506 TGCAGTTCCGTCXX3GAGCAGTTCC 

38^ VGFRA EQL QGNA LMPRGGQI TLA OFVGEWWPSYEKTL KP 

1622 ACC^XXGTGAACTCGGAGGGCAACCGCATCCGCAA 

77*TAVNSEGN-Rt RNHLLPI IGHLTLOELOGQVTQQwvndle 

Wi$ GGCCGGCGTCGOrCCGTGGC^^ 

,Til5> AGVGPWPESTRGRRKPLAAKTl SNCHGLLHTt CGAAIA 

"1654 CKL^CCX^TCAGGCTCAACCCGTGCT^^ 

"4.154*A KRI RL NPCSSTML PRREPKEMKFL SDPE1 GRL f T A L P P 

hlO CACTGGCGACCXKTTCGTCATGCTGCTGGTO 
f|193> HWRPLVML L V A T G L RWGEA I G L RAGRVDLLAARPRLTVV 

.jl086 OSAGCAGCTCCAGGAGCTGGCCAGCACC^ 

231> EQLQELASTGELVFQSPKTAKGRRTVSFTTKVALLLTP 

- = 2202 TCATCGCCGGAAAGAAAAC?I^ACGAGGTCG^ 

^ 270H. I AGKKSOEVVFTAPKGGMVRTRNFRR1 WVKACEEAGLP 

251 8 GCX7ITACGCATTCACGATCTGCGGCACACTC 
= 309* GLRI HDLRHTKAAi Li SAGRPLSAI SRRLGHSSIAVTDL 

2434 GCTGTACGGGCACtrrTSCGT^ 

" 34?» LYGHL R E E V D E G I L A A I EEAMAGVRA EOL EA EL DEEL T 

2550 MXnGVTGQCXGPiCGC^^ 
; 386>D V L A D A A • 

2665 TGACCgX7IXraX*GQCXXXCGCT^ 



2768 ATGCCTCATAGCXSC^UZAGCTA^ 



2884 GCGAiXTGATCGCGCAGCTGCGCCTCC^^ 

3232 ATOCXnCCGTCTCCCATCAQCGAOOC^^ 

3348 CCCGTKXSACTGGTGACCGAAAGC^ 

J464 TGGGCQCCGGTIOtSACAATGACOTAGA^ 

J 580 AACCCGGATCAGGOGCGCGGAATGCGCADGCCTACCTCATCCG 

i696 CtXrrCTX^rCCTCCXXXXnGGGCATCC^ 

J812 CGWTTCCCTCAGCW^XAGeTCTW^^ 

19 2 8 TCCTGGATGCGGGCGCGCAGCAGCG^ 

1044 CGTTCGAACGTTCGGTQCSCTCTA^ 

1160 TSACCGAGTACAGATaSGCC^GCTCr^^ 

i276 ATGGGCATKXXWATCTCCCrnxrr^^ 



Figure 7B 



pSPRH826b Insertion plasmid 
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1 GGTACCCGACCGTGTCCCGGAACAACGAGTCGAGATACGGCGAGAGGAACACCCCCC(^TAGTCCGGGTAGACGGTGGGCGCGAAGGCGTAC 
93 GCGCCTTCGACGGTCAGCGGGCGGG^ 

<• STVHVRPVYL I WQGGA 
184 CGGCGGAACTCCTGCTCCTTCGCCATGATCTCGTCGGCGTGGTTC 

<R RFEQEKAMI EDAHNWAFLLAYDVADPTFAD 
27 6 CGGGGTGCGC AC CGGG ATGTGCGTGCCGGGGGTG AGCCGGC C CTGCTTGGCCGGCGTCG 

<PTRVP I HTGPTLRGQKAPTTDCVWSVLDPG I 
3 68 TGCCGCAGAAGTTCGTCACGGTGGCGCTCTTCGCCGT^ 

<GCFNTVTASKATAGYAVVRKGEAKLSNLLA 
460 AGCAGGTCGGTGCGGATGCCCTCGACGTCGGCGGCGAACCTGTCGAGCCGGCTGCGGTCGGCGACCCCTCGGGCGTCCTC 

<L LDTRIGEVDAAFRDLRSRDAVGRADEEG I L 
552 CGCGGCCACCCGCTCGGCCGGCTGCCGCGCGCCGCCACGGGCGATGGTGTAGCGGACCTCCCCACCGTGCACCGGAAG^ 

<AAVREAPQRAGARAITYRVEGGHVPLREVDV 
644 CGAGCGCGAAGCCGAAGCGGGCCGCCAGCGCCTGCACCGACCGCGCCGAGAAGAAGAAGAAATGCTCGTCGTAGATCTGGTCGAAGGACGTC 

<LAFGFRAALAQVSRASFFFFHEDY IQDFST 
736 TTGTCCAGGATGTCCCCGAGGTACGGGTCCTCGAAGACGAAGACCCCGTCCGGTGCCAGCAGCGCGTCAACGCCCCGGAGGATGGAGTCGAG 

<K DLI DGLYPDEFVFVGDPALLADVGRLISDL 
82 8 GTAGGGGATGTGGCAGATGGTGTTGGCCGCGAAG ATCACATCGGCGGGGCCGTCGGTCTCCCTGACCCGGCGGGCGGTGGACTCCTCGAAGA 

<Y P I HC ITNAAF I VDAPGDTERVRRATS EEFF 
920 ACTCGGTGACCACCCGCACCCCGTGCCCACGGGCC ACATCGGCC ACCCCACCCGACGGCTCGAAGCCC AGGTGCCGC ACTCCCGCCTCGTGG 

< ETVVRVGHGRAVDAVGGSPEFGLHRVGAEH 
1012 ACGGTACGCAGCATCACCCCGTCGTTGCAGCCGATCTCCACCACGAACGGGTCCGGGCCGGTG^ 

<VTRLMVGDNCG I EVVFPDPGTAEHELLHRAT 
1104 GTCGGCGAAGTGCTTCTGCATCACGGAGGAGCCCGACGAGTGGTACGGGT 

<DAFHKQMVSSGSSHYPYDQHFMEERPVEEML 
1196 GCTCCACCATCGTGCAGCCCGCGCAGACCCCCACCGCCAGGTGGAAGAAGAACT 

<QVMTCGACVGVALHFFFEDALQDETLFRDS 

12 88 AGCGGCTGCCGACCCAGGTCGAGGAACTGGGCGGTGGTGCCGCCAC ACGCCCGGCAGGTGCCGCGCCGGGGCGTGCCGTCGGCTGAATCGGT 

<L PQRGLDLFQATTGGCARCTGRRPTGDASDT 

13 80 CATCGCATTCCTTCCATGGATACCCCTGCCTCAGGCAGGGCGGATGTCAACGACGTCTCCTTGTCGGGATGGGTCGTCC AGCTCTCGGC 

< M 

14 6 9 TGCGGCGTGC CGG C CGGC G ACTC AG AGCGC CGCG AG C ACGTCGCG C AG CG CG TCGATC A C CC GGTCCTGCG TC TCCGGCGG C AGCG AGGG A 

LAALVDRLADIVRDQTEPPLSP 
1560 TACATGGGGAGXSGAGAAGATCTCGCCGGCCAACCGTTCCGTGGTG^ 

<YMPLSF I EGALRETTPLSGTQYGLKAFGTMT 
1652 GTGCACCGGCCAGGGGTAGCTGATGTTGAGCGCGATGTCCCGCGCCGCCAACTGTTCGA 

<HVPWPYSINLAIDRAALQELIRDRAPHRVVY 
17 44 AGACGTACCAGACGTGTTCGTTGTCGGTCGCCGTCACCGGCAGGGTCAGGCCG ACCTGGTCGGCGATGTCGGCC AGGCCCTCCTCGTAGCGC 

<VYWVHENDTATVPLTLGVQDAIDALGEEYR 
183 6 CGGGCCACGGCGCGGCGCCCGGCGATGTAGTCGTCGAGCCGGCACAGCTTGCGGCGCAGGATCTCGGCCTGCACCTCGTCGAGGCGGCAGTT 

<R AVARRGAI YDDLRCLKRRL I EAQVEDLRCN 
1928 GTGCCCCGGCGTCTCGACGACGTAGTAGACCTGTTCC ATGCCGTAGT AGCGC AGCCGCCGCAGCCGCTCGTCGATC ACGGCGTCCGCGGTG A 

<HGPTEVVYYVQEMGYYRLRRLR£DIVADATV 
2020 CCACGCCGCCGCCGTCGCCGTACGCGCCCAGC ACCTTCGTCGGATAGAACGAGAAGGCAGCGGCGTCGCCCATCGTGCCGGCC AGCCGTCCG 

<VGGGDGYAGLVKTPYFSFAAADGMTGALRG 
2112 CCCCGACGCGCGCCGTGCGACTGCGCGCAGTCCTCCAGG ACCTTGAGGCCGTGTTCCCGGGCCACCCGCAGCACCGGGTCC ATGTCGACGCA 

<G RRAGHSQACDELVKLGHERAVRLVPDMDVC 
2204 CTGGCCGTAGAGGTGCACCGGCAGCAGCGCCTTCGTCCGGG^GGTGACCGCCTCGGCCAGCAGCT^ 

<QGYLHVPLLAKTRPTVAEALLETDMLYDDAR 

22 9 6 GGACGTCCACGAAGACCGGCGTCGCGCCG ACCGCGTCG ATGGCGAGC ACCGTCGGCGCGGCCGTGTTGG AG ACGGTG ATG ACCTCGTCGCCC 

<VDVFVPTAGVAD IALVTPAATNSVT IVEDG 

23 8 8 GGCCCGACGTCGAGCGCCTGX3AGTGCGAGCTTGATG 

<P GVDLAQLALK I ANTGNDVTVCH PMDHYAAF 
2480 CTCCTGKTTCGAAGCCGCGCACGCTCGCGCCGAGGATGAGGTTCCCGGACT^ 

<EQEFGRVSAGL I LNGSEFVTQVADLLDDREK 
2572 TCTCGTACTCCGGCAGGTAGCCCCACACTCGGATGGTCATCTTCGCCCTTCTACGCC 

<EYEPLYGWVRITM <• ASTRSRLARVSM 

2661 GTAGTeGTTGTCCCGGTCGAGTCCAAGTGCCTGACCGCTCAGGT^ 

<YDNDRDLG LAQGS LYDVADVY S Y PQMFGGAR 
2753 GCACGTCGCGATAGAGCaXXSAGAGTGGGTGCCCGGCCGTGTAGGCGAGGCCGCC^ 

<VDRYLRSLPHGATYALGGVLSLCDDVVAPA 
2845 AGCTCGTTGACGGTCATCTTCGCGTACTGG AACGGGG^ 

<L ENVTMKAYQFPTMMRRGREDPDGSLDVSAA 
2937 G^TCGGCGTTGGTCAACGCC<3CGCCGAC CGTGGTGCGT AGCGCGT AG AGCCGGGTGTCC AGCC CGGCG ACC AGC G CC C GGGC ACCGGCCCGTG 

<DANTLAAGVTT RLAY LRTDLGAVLARAGAR P 
3029 GCTCGCCGCCGCGCCCCGCGCAGAAACCGACCGCGATGTCCCGGGCCGCCTGGGCG^ 

< EGGRGACFGVA IDRAAQAIGAY IGLMTISS 
3121 ACCGTCTGC CCGGCC AGC ACGGCGTCCCGCCGGGCC CCG ACCGGGCCGCGCTCC AGC AGCTCGTCGGC C CGGAC CGGGC ACCGGTCGAAGA C 

<VTQGALVADRRAGVPGRELLEDARVPCRDFV 
3213 CACCTCCAGCGTCCCCGAG<^ACGCATTCCCAGGCCGTCCCAG 

<VELTGSARMGLGDWNDLVTLGPADRHVVPVA 
3305 CGAGGAACACCGAGCCGTCGTCGTCGXrGCCGCTGGGCGTGC ACGAAGAAGTGGGTCGCGATGGGCGCCATGCTGACCAGCACCTTGCGGCCC 

<LFVSGDDDRRQAKVFFHTAI PAMSVLVKRG 
3397 GACAGCAGCCAGCCGCCGX3CGCCGTCGGAATGCAGCTCGGTGACCACGCCGGGGGCGTCCTTGAGCGCGC 

<SLLWGGAGDSHLETVVGPADKLAGCVAAEG£ 
3489 CGCCATCGCCCGCAGCAGCCGCTCCGCCATCGCGCGC^ 

<AMARLLREAMARVPPTGHQWEYTLTLGRSLQ 
3 581 GCACGTGCCAGGCCAGCGCGGTGCACtSCGTCGGCCTCGGCCAGCCGCATC^ 

<VHWALATSADAEALRMLATAVDYLRTLGLG 
3 67 3 CCCAGCTCGGCGGGAACGGTGGCGCCCATCAACCCGAGCTTCGCGAACTOC^^ 

<G LEAPVTAGMLGLKAFQEFAEVPFTGTRDRD 
37 65 GGCGGCCTCCGCACTGATTCTCGGGATCACGCCGGCAAGCAGGTCGACCACCGTGCGCCCCGCCGGCGTGAGCGGCGCGCGAAGATCCGCCG 



Figure 11 A 



<AAEASIRPIVGALLDVVTRGAPTLPARLDAA 
3857 CCACCCATCTCTCCTCGGTCAGATTAGAC^^ 

3947 < AATTGCTGGCTGATTCTCCACCGAC^^ 

4039 TCGC^AACTCTAGTTGGAT 

till TGCGGAGGACT^GTGAAG^ 

>VK I LFIAGPTKSSLFGLAPLAIAARM 

4221 AGCGGGCACGAGGTCGTGATC^CTTCCACGCAGGAGGTCGTACCGGCGACGATGTCCGTCGG^ 

>SGHEVVMASTQEVVPATMSVGLPAFPLAALT 

4313 CCTCGCCGAGCTCA^^ 

>laelmttdragdplripaedaafvpfvgrm 

4405 tcggccggctggcggcgatcagcctggatccgctgcck^ 

>fgrlaaisldplrdlvggwrpdlivggphay 

4497 GCCGCGCCGATCCraCCACCGAACTTC^^ 

>AAPILATELGVPCVRHLLTGNPVDREGTHPG 

589 GGTCGACGAGGAGCTGCGGCCGGAGCTGGCCGCGCTCGGCCTGGCCCAGGTGCCGCCGTTCCACCTC^ 

>VDEELRPELAALGLAQVPPFHLALDIFPAS 
4681 CCCGGATCGACGACGTCCCGCCGGCGCAGCCGGTGCGACCGCTGCGCTGGATTCCGACCAACCAGCAGCAGCCGGTGGCGCCGTGGATGCTC 

>T RIDDVPPAQPVRPLRWI PTNQQQPVAPWML 
4773 TCCXTGCGGGCCGCGTCGCCGTGTCCT^TCACCGCCGGCAGTCTGGTCACCACCACCCACAACTTCGACTTCCTC 

>SRGPRRRVLVTAGSLVTTTHNFDFLHGLAGT 
48 6 5 CCTGGCCGAGC AGGACGTCGAGGTCGTGGTCGCGGCGCCGCCGGAGGTGGGTCGGGCCCTGCACGACGTGCCGGGTGTGCGGCACGCCGGCT 

>LAEODVEVVVAAPPEVGRALHDVPGVRHAG 
4957 GGCTCCCGCTGGACGTXjGTGCTGCCCCACT^TGACCTGATCGTGCACCACTCCGGCACGATGACCGCGCTGACCGCCTTGAACGCGGGGGTG 
>WLPLDVVLPHCDLIVHHSGTMTALTALNAGV 
CCCCAGCTGATCGTGCCGCAGGAGAGCCGGTTCATCGAGTGGGCGCGCAACCTGTCGACCCTGG<3CGTX^CGCAGACCCTCGCGCCGGGCGA 
_ -,.„«^^T^r-.T c- t.i r p m T . Q t T.GVAO TL A P G E 



4 



5049 



>POLIVPQESRF I EWARNLSTLGVAQTLAPGE 
5141 GGACACGCCGGAGGCCGTGGGCAAGGTCGCCCGCCTGCTGCTC^AGGATCCGGTCCACGCCACCAG^ 

>DTPEAVGKVARLLLEDPVHATSAAAIAREI 
5233 CCGAGATGCCCGGCCCCACGGAGGTCGTGGGCCAGCTCACCGAGTTCGCGACCCGGGGCCTGACATGCGCGTCCTCGTGACCGGCGG AGCC 

>A EMPG PTEVVGQLT E FATRGLTCAS S • 

> V T G G A 

5324 GGGTTCATCGGCTCCCACCTCACCGACGCGCTGCTCGAACGCGGCGACAGCGTCACCGTGCTCGACGACCTGTCCACCGGGCGGCCCGAGCG 
>GF IGSHLTDALLERGDSVTVLDDLSTGRPER 

5416 GCTGCCCGCCGGGGTGCCGCTGCACCACGGGTCGATCACCGACCGGGCCGGGTTGACCCGGCTGGCCGAGCAGTGTCGCCCGGAGGTCATCT 

>T J PAGVPLHHGS ITDRAGLTRLAEQCRPEVI 

5508 GCCACCTGGCCGCCCAGGCC^ACGTGCGCAACTCGGTCGCCGACGCCA^ 

>rHLAAOADVRNSVADATSDTGVNVVGTVNVL 

5600 GAGGCCGCCCGGGCCATCGACGCCC^ 

>FAARAIDARVVFASSGGALYGEVDELPSFi^u 

5692 CGTCCGGCCGGCGCCGTOGGCGCCGTACGGGGCCGCCAAGTACTCCGCGGAGCAGTACC 

*\7RPAPWAPYGAAKYCAEQYLALYNRLYGS l 

5784 ACGCGGCGCTGCGGCTCGGCAACGTGTACGGGCCACGCCAGGACCCGACCGGCGAGGCCGGG^ 

vh&aLRLGNVYGPRQDPTGEAGVVSIFCGCLV 

5876 GCCGGGCGCCGGC CG ACGGTGTTCGGCG ACGGCG AGC AG ACC CGGG AC T AC ATCT ACGTGGC CG ACGTGGTGG AGG CGTTC CTGC TCGCGGT 
>AGRRPTVFGDGEQTRDYIYVADVVEAFLLAV 

5968 CGGGCACGGTGGCCCCGGCCTGTGGAACATCCGCACCGGGACCTCCACCAGCATCCGC AAACTACTGGACCTGGTCGGCCGCACCGCCGGGC 
>GHGGPGLWNj.GTGTSTS I RKLLDLVGRTAG 

6060 gcgtcccggacccccgcttcgagcca^ 

>R vPDPRFEPPRLGELKHSALEVTRAARELRW 

6152 gcggcccgaacgaggctcgccgacggcatcgcg aaggtctacaagtgggtcg aggcggacgaaccggtccggggggagcgatgacccgcg 

> M T R 

>AARTRLADGIAKVYKWVEADEPVRGER* 

6242 aggggtcj^cgccgccggttagggt<xx:cac^ 

>EGSTPPVRVATI TVGTNE I RWLDRALGS LL A 



-EG ST P PVRVAT I T V G T N & l ^w^^^^^w^ rmn ^ 
6334 AGCGACACGACCGGCTTCGAGCTGACGGTCTTCTACGTGGACAACGCCTC 

>SDTTGFELTVFYVDNASADGSVAHVMSAFPG 

6426 CGTCCGGGTCATCCGAAACCCCCGCAATCTCGGCTTCACCGGCGCGAA 

>VRVI RNPRNLG FTGANNVGMRAALAEG F D H 
6518 T<TITCCT<^TCAACCCGGACACC^^ 

>iflvnpdtwtppglvrglvefaqrwpqygvi 



-IFLVNPDTWTPPGLVRGLVfc.tAvnwr-v * ^ v * 

6610 GGCCCGTTGCAGTACCGCTACGACCCGGCGTCGAC 

>GPLQYRYDPASTELTDFNDWTQVALYLGEQH 

6702 caccttcgccc^cgacctcctggatcatccc^ 

>TFAGDLLDHPSHVTATVRDRAPRTLEHAYV 

67 94 agggctcggcgctcttwtccgggccxjccgtgctacgcgaggtc 

>OGSALFVRAAVLREVGLLDEVFHTYYEEVDL 

6886 TGCCGGCGGGCCCGGTGGGOXXXrrGGCG^ 

>CRRARWAGWRVALLLDLGIQHKGGGGTAASA 

6978 GTACAGCCGGATACACATGCGCCGCAACCGCTACTACTATCTGCTGACCGATGTGGA 

>YSRIHMRRNRYYYLLTDVDWPPAKAARLAA 

7070 GCTGGCTGTTCTCCGACGTCCGTGGGCG^ 

>R WLFSDVRGRGVTGRTSAGVGARETFVALGW 

7162 CTX^CCCG^CAGGCCCCGX3TGATCCGX5GAACGTCGT 

>LARQAPVIRERRRRHRLLRARGTGVDRARER 

72 54 GAAGGAAACCGTGCGGGGATGAGCAGGCCA03GATTCTCGTC^ 

> K E T V R G • _ . _ _ 

>MSRPR I LVAGNFHWQAGFSQTVAA 
7345 ACGTGCGGGCGGCCCGGGAGGCCGACTGCGAGGTGCGGCTCTGCGGCCCGCTC 

>YVRAAREADCEVRLCGPLSRVDAETARHLPV 

7437 GAGCCGGACCTCCGCTGGGGCACCCACCTGGTC 

>EPDLRWGTHLVIMFEAKQFLTEAQLDLVEAF 

Figure I IB 



7529 CCCCCGACAGCGCCGGGCCATCGTCGACTTCGACGGGC^ 

> PRQRRAIVDFDGHWGAEEGGDGDSASGRYS 
7 621 CGGAGAGTTGGCGGCGGTTGTACTCGACCCTGAGCGACCTGATCCTC 

>AESWRRLYSTLSDLILQ;PRLGPLPAGARFFK 
7713 TGCTTCGGX:CTGX3CAGCGCCGGTGCG<;CACCCGCTGGAACTGGGCACCGG 

>CFGLAAPVRHPLELGTGAQSRPYDLQYIGSN 
7 805 CTGGTGGCGGTGGGAGCCGATGACCGAGATGGTCGAGGC 

>WWRWEPMTEMVEAAAAARPPLRRLRVCGRW 

7897 gggacg^^^agttgcgcgggcttcgaggaggcgacgctc 

>W DGGSCAGFEEATLSEPGWLRARGVEVHPPV 
7 9 8 9 CCGTTCGGCCACGTGGTCGAGCAGATGGGCCGGTCGCTGATCTCACCGGTCCTGGTGCGGCCGCTG^ 

>PFGHVVEQMGRSL I S PVLVRP LVTSTGLLT P 
B 0 8 1 C CGGATGTTCG AG ACGCTGGCCTCGGGC AGCCTGC CCGTGCTCCCGG TCGCCGCGAAG TTCCTCGCGC CGG TCT AC GGCG AC G AGGC GG AAC 

>RMFETLASGSLPVLPVAAKFLAPVYGDEAE 
817 3 ACCTGATGCTCGGCGACGACCCGGCCGGAACGCTGAGCCGGCTCTCGGCCGAGCACGAACGGTACGGAC^^ 

>HLMLGDDPAGTLSRLSAEHERYGRLVGEIQD 

82 6 5 CGGCTCCGCGTCGAGTACGGCTACCC^CGCGTCCTGCGGGACCTGCTCGATCTGCTGGCCTGAGGAATGAGGAGCAGATGACCCCCCTG 

> M T P L 

>RLRVEYGY PRVLRDLL.DLLA • 

83 54 CGGATCGCGATGGTCAACATACCGTTCCGGTTGCCGAGCGACGAGCGGCAGTGGATCACGGTCCCGCCGCAGGGGTACGGCGGGATCCAGTG 

>RIAMVNI FFRLPSDERQWITVPPQGYGG I QW 
844 6 GATCGTGGCCAACAAGATCAAGGGCCTGCTCGAACTCGGGCACG AGGTGTTCCTGCTCGGTGCCCCGGGCAGTCCGCGT ACGC ATCCACGCC 

> IVANKIKGLLELGHEVFLLGAPGSPRTHPR 
853 8 TGACCGTGGTGCCGGCGGGCGAGCCCGAGGACATCCGGGCATGGTTGAAGTCCGCTCCG<3TGGACGTCGTCAACGACTACAGCTGCGGCAAG 

>LTVVPAGEPEDIRAWLKSAPVDVVNDYSCGK 
8630 GTGGATCCGATCGAGCTGCCCCCGGGGGTCGGCCTGGTGGCCTCGCACCACATGACC ACCCGCCCGTCCTATCCGGCCGGCTGCGTGTACGC 

>VDP I ELPPGVGLVASHHMTTRPSYPAGCVYA 
87 22 CTCGAAGGCGCAGCGGGAGCAGTGCGGCGGCGGCGCGGACGCCCCGGTCATCCCGATCGGGGTGGATCCGTCGCTCTACCGCCCCGGCGACC 

> S K A Q ~ R EQCGGGADAPV I P I GVDP SLYR PGD 
8814 GCAAGGACGACTTCCTGCTCTTCATGGGCCGG ATCTCCCCGTTCAAGGGCGCGCTGGAGGCGGCCGCGTTCGCCCGGGCCGCCGGCCGCCGG 

>R KDDFLLFMGRI SPFKGALEAAAFARAAGRR 
8906 CTACTGATGX?CCGGTCCGXSCCTGG<5AGCCGGAGT^ 

> L L M A G PAWE PEYLDR IMGE YGDHVTLVGEVG 

8998 ggx3tcaggaacgtatggacc^ctcgccacggcggctgccatcctggtgctctcccagccggtgccc 

>gqermdllataaailvlsqpvpgpwggtwc 
9 090 agccgggtgcgaccgtggtgtccg aggcggcggcc agcggc accc c ggtggtcggcac g agc aac ggctg c ctggcgg ag atcgtgcc gg c c 

>e pgatvvseaaasgtpvvgtsngclae ivpa 
9182 gtcggcgaggtggtgggcttcggcaccggcttcgacgagcgggaggcccgagcggtgctgtcccg 

>vgevvgfgtgfderearavlsrlpspaqark 
9274 ggccgcgatccggtgctggx3ggcacgtg<3agatcgcccggcgctacgaggcggtgtaccgcgacgtgctggc 

>aaircwghveiarryeavyrdvlagarws • 
9365 gccggcccgx3gcggcgxk:tacg^tcg<:gacc^ 

94 57 gccaggtcagctccggtcgtgcagccagggtcgcggctcggggtcggtcacctcgaccggctggctcatgaacaccaggacgtacgcgcgg 

<• srdhlwprpepdtvevpqsmfvlvyar 
9548 cgcggctggtccgtctcgttcggxxtcggcgtagtgcggcgcacggaagtc^ 

<rpqdtenpgayhparfdhmvaegprlpcavs 

9640 GTCCGTGTCGTCGACCTCGTCCGTCATCAGAC^TTCGATGCGGT 

<DTDDVEDTMLGEIRDDHNIHHHPLVGGRHLG 
9732 CCGGCAGGTACTGGAGACAGCCGCTGG ACACCGTCGCCTCGTCGAGCGGGGTCC AGATGCTC AGGCCGCG ACGGGACCAGCGCGGGTCCATG 

<PLYQLCGSSVTAEDLPTWISLGRRSWRPDM 
9824 TACGCCTCGTCCTGGTGCCACG<3AGTGGGTGCGCCGTAGCGCG^ 

<Y AEDQHWPTPAGYRPPKLIAHGYFDLEDEPI 
9916 G TC G AGG AAAGC GG AGGC G ATGG C C CC<5 C ATC GCG C GAAGTGCG CGG TCTCC AC CAACTC 

<DLFASAIARCRAFHATEVLEPLYKEPRVIQP 
10008 C^AGACGGGCAGGCGCGGCC^TCGTCGXTCACGGCCGGCGATGTCCCGGTAGTCGCO^ 

< LRAPAADDGRGAI DRYDGTDPSAHDAFLRD 
10100 TAGGCGGCCCGGAGCCAGGCGACCTCGGCGTCGTCGC^GAGCTGCGGGAGAGTCACGA^ 

<YAARLWAVEADDALQPLTVFGDRRYAELRRD 
10192 GACGACCTCCGCACCAACAG^TCCCCACGGCCATTTGACCACCTC 

<VVEAGVTGVAM 
10282 GCGATACCGCTCCCGAGCGGGAAATAGGGATTCGACTAGT^ 
1037 4 CCCCGTGCXiAGACTCC^CTCC^TGTCCT^ 

>MTGHSAVALDVGGV 

10465 CGTCTACTACGACGAGCCGTTCGAGXrTGGCCTGGCTCCAG^ 

>VYYDEPFELAWLQDTFDRLQATDPTLDLRA 
10557 TTCTGGAGCACGTCGAGCGGTTCTACCACTACG^ 

>FLEKVERFYHYGEGDPTGRTWLHSEAAALSW 
1064 9 TCGCGGGTCCGGCAGTCCTGGGGCGAGCTCGCCCAGGAGATTCCCG^ 

>SRVRQSWG ELAQE I PGAVRAVTRLARELPVV 
10741 GATCGTCGCCAACCAGCCCCCCGAGTGCGCGGACGTACTGGCCCGGTGGCAGGTCAGCC 

> IVANQPPECADVLARWQVSQVCREVLLDSL 
1083 3 TO^GGGTGGCCAAGCCCGACCCGGCCCTGCTCC^CTC^ 

>VGVAKPDPALLGLALRRLAIPPAELLVVGNR 
10925 ACGGATCACGACGTCCTGCCCGCGCTCGGGCTCGGTTGC^ 

>TDHDVLPALGLGCPVAFVLPDPAYRRPPGVH 
11017 TCCGX3ACCTGX3TCCt3GGTCTACACG^ 

> PDLVRVYTELRAFRTGS PPADARVTTVA S L 
11109 CGGCCCTGGCCGACTCTCCCCTGACGAGTGCCACCCCGCG^^ 

>A A L A D S PLTSATPRSNAGTGGL- 
112 00 CCGCAGCGGTGGTCGGCGCCACCGGCTTCATCGGCTCAC 
11291 TGCCGCCCCACCCGTCGTCGACGGCC^CCCGC^ 
11383 TGGCGGAGCAGCAACCGGAACGGGTC<3TCCGGGAACGCGAGTTC 
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11475 TTCGTCCTGGCCAGCTCGGGCGGGGCGGTGTACACGCCGACGGTGTGGCCGCCCTACCACGAGCGGTCGGCCACC 
11567 CGGCCGGGCGAAGCTGCGGCTGGAACAGGAGCTGCTGCGCCACACCGACCGGGTGCAGCCGGT^ 
11659 CGGGGCAGCGGCCGACCCCCGGGTACGGTGTCCTGTCA 

117 51 GCCGTGGTGCGGGACTACGTACACGTCGACGACGTCACCGCGATCATGGAGGTC 

11843 GCCCACGX3TCGTGAACGTCGGCTCGGGCCTGCCCACCTCCCTCGCCGAGTTGCTCCAGACGATGTC 

11935 tcatccgggacgtccgccggcagttcgaccatcggg<x:aactggctcg ACACCACCCTCGCCCGGGAGACCCTGGGCTGGCAGGCGCGGATC 
12 027 AGCCTCCCCGACGGCGTCCGCCAGTGCTGG<;AGGCCGTCCTC^^ 

>SLPDGVRQCWEAVLTRAGGPGGS PARPSAR 

< - G P 

12118 TCGGGAGAGCGTCTCG<KX^GXX3AACCGCCGCAACCGCGCCCTTC 

>LGRASRGREPPQPRPSQQFVAQPGGGRRGVA 
<EPSRRPPPFRRLRARRLLEHSLGAATAATYG 

^unct ion mark 

12210 G AGGGCCAGTGGC AGGGCG AGC CGGGATTCCGGGGC CG T AC GGTGC C GGTCCC ATTCCTTGCGC AGG C CGG C CC GTG C CTGGTTC C GT C CGG 
>EGQWQGEPGFRGRTVPVPFLAQAGPCLVPSG 
<LALPLALRSEPATRHRDWEKRLGARAQNRGA 

123 02 CCTCGCACCGCCCCTGCCAGTACGCCCGCCGCAGCAGGTACCGGGGGGTCAGCCGGCCCGGGTCGATGTCATGGGTGACC 
>LAPPLPVRPPQQVPGGQPARVDVMGD 
<ECRGQWYARRLLVR PTLRGPDIDHTV 

12382 GCGTGGTCCGGG AGC AGTTGCTCGCGGGCGCCGGCGGCCTTC ATGGCGCTGATG AAGGAGGTGTCCTCC 
>RVVREQLLAGAGGLHGAD£GGVL 
<AHDPLLQERAGAAKMAS I FSTDE 

12451 CCTGACTGGAGGTTGCCCCCGGTACGGCTGAGGGCCAGGTCGAAATCCAACCCGTGGGCGTGCGCGAAC 
>P • 

<GSQLNGGTRSLALDFDLGHAHAF 
12520 GCGGAGTCCACCCCCATGCACGCGCCCCAGATCTTGATGTTT 

<ASDVGMCAGWI K I NGQDRHWGVLHFQGS 

junction marker 

12603 ggtgacgtaccacggcagacgcagcgcggggcgggccagccgggtgccgacc acgtgcgcgccgtcgcgcaggctcttgcggacggcgtcga 

<tvywplrlapralrtgvvhagdrlskrvadv 
1269 5 cggcagcggcgtcgagccgcacgtcgtcgtcgacgaacatcagatggtggtgcggcc agcgggcgagc atcgcgttgcgggaggccg acagg 

<aaadlrvdddvfmlhkhpwralmanrsasl 
127 87 ccattggtggcaccgaggatgcgcatggtgccgccggcggcccggacctcctcckk:gacctcctccgcctcggccgtgacgggccggtccag 

<gntaglirmtggaarveeave£aeatvprdl 
12 879 caggacgtagtactcgtcgccggagagctgggcc atgttgtgcgcgaggtgtttcctgacgttctccacccggaacgcgcagatcgcc acca 

<lvyyedgslqamnhalhkrvn£vrfaciavv 
12971 ccatcgggtggtcggacg<5atcgcggctgaccacagaggcgttgttcggcatcgtccctcc^ 

<MPHDSPDRSVVSANNPM 
13061 GGCGGCCCACCGGATCGCGCTCCGCGGCAGCGTGCCGCGGCCCCGGCCGCCCC ACGGACAGGTGGCGCACCGCAGCGCGGCGGGCCGTCCGG 
13153 TaAGCGGCCG^GG<K:GX3AGCTCAGX3CCGCTTCGTCCTTGCTGACGGTCC^ CTT 
13245 CGCGATCAGXSGCCGGCCGGTTGXSGGACTCGCAGGTTCTGCAAC^ 
13337 TGGCGATATGCACCGTCCGTTCGCCTGCGGCGATGCT^^ 
13429 CCTTGATTGTGGATCTGTTGGCTCGTACGCGGCCAGACGTC 

13521 GGAAGACCGTCAGCGGCGGAAGGCCGGGTGCCCGCC ACTCGTC GTCCTC^GCTTGOGGAGCl^TCTCAGTGTGCGGCGATGCCCCCGGCACG 
13613 TAGCCCGCCTGACGGGCCTGTCGGCTTGGGCTCCAACTTCGGCAACCCATCGGTT^ 
13705 GAAGCCCTTCGCTACTGTCGATCGACATCACTGGCGAAATGAAACCGACCGGAT^ 

13797 CTGTCGCTCGCGCTGAACTCACCAATACGCC? ^AAGCGTAGCCGGCCCACriXK:G<5AGCGTCCACCCCCCGAGGATATCGCCAGGCTTCCATG 
13889 CAGAACTGGCAGGATCTTTCATCTCAGCCGCACCTGGCGACAAAACCCCT^ 
13981 GACATGTGTCACTTTAGACAACCCAGCTCCAGCCAGGCCCACCACC 

14073 CTAGX3GGATTCCTTAGTCTGGCGAGCGTCGATCCCCCACCCGTC AATCAATTGCCCG AAC AAATTGCCCACCCGTCAAGATCAACCACTCCG 
14165 CACCGTGAGCAGGCCGACCGTCCGGC^TGCGAACGACGACGTCCACCCCCOK:CC^ 
14257 CACCGTCCGGCGTCGTGATCAGCG<OTCGCTCGCCGGCTGTCGC^ 
14349 GCGACCTGTCAGCCCTGCAAGCG<3ACCGGCATCGCGCGCGGGGCC 

< • RRSRD PLRS 

14440 CGCGGCCTCGTCCCAGAGCCCGCCCACCGCATCCCGCAGCGATCGTCGGCGCC 

<AAEDWLGGVADRLSRRPRWGLLDRAPGPDVR 
14532 GGGCCCAGTCGACGACCTCCACCGAGCCGGGCCGATCCGGCAACTCCACCACCTC^^ 

<AWDVVEVSGPRDPLEVVETPVGSVEILMEV 
14624 AGGGAGCGGACCGGAACGGCCTCCCCCCGCCCCACGCCG ATC ACCCGCCCGGTGAC<^ACTCCCGCGTGGCCGAC ACC ACCGCCTCGGCGAC 

<L SRVPVAEGRGVG IVRGTVSERTASVVAEAV 
1 47 1 6 Gl<!CCGCACGTCGACGTAGTCCCGGTGGGCGCGCAACGGAGACAACn:CACCCTGG^ 

<DRVDVYDRHARL P S LEVRASRDRGAADVLRV 
14808 CGACGACCCGCCCC^CAG<^TGTCGGGTGGCACGCCCG^ 

<VVRGLLSDP PVGPGVVNALRLVTADVSGAR 
14900 GTCGCCGCGAGCACGGCCTGXSGTGGCGGCCAGCITC^ 

<TAALVAQTAALKARGYMSEPQTPVTAGAPAG 
14992 GGGTGGCTCCTGGACGCGTTCCAGGACCGAGCCGAGGTGCACCAGCCTGGGCC 

<PPEQVRELVSGLHVLRPRCRTRELAATVSWT 
15084 TCGGGATCGTGCAGCTCGACGGC AGCTCGGCCTCCGTC AG^ 

< P ITCSSPLEADTLNWKGGTANVVADPQEAD 
15176 AACACGGCGGCCAGCGCGGCGWCTCCAGGGTWCGACGTCCAGCG^^ 

<F VAALAAPELTAVDLARARYPLGAS PVRRAL 
15268 CACGAC^ACGTCGTCGCCCCG<X^GGCAAGCGCGGCGCTCACGTGACGCCCG 

<VLVDDGRAALAASVHRGVFGTGGVVVVRRAG 
153 60 CCATCCGTACCTCCTG<]K3GATCAGTCTCGTG^ 

<M <• DRAGAADLRGGPGQCTVPARLV 

15449 CCTGGCCCGGTCGCGAACCTCGTCCAGCAGCCGGGCCCGGGCGC^ 

<RARDRVEDLLRARAR I ATVEEAPQGATVARV 
15541 CGAACTCTCGCATCGTGTTGACGAACTCGTCCTCGGC^^ 

<FERMTNVFQ_DEAPFTLERTEDQREVRVVPH 
1563 3 CAGGCCGGCGGTGGGGTGTACGCCCGGTCGACGACGATCC 

<W APPPTYARDVV IRGASGWLQYECRYSHEFG 
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15725 GAAGGCGATCTGTGCGGTCCGCCCGTCCGGCGTACACAA^ 

<FAIQATRGDPTCLLAAGSVDVGRDPDERLTA 
15817 CCGCCACCACCTCCGGCTCCTCCGGCAGGAAGAACCGGGCCGCGCCCAACGGGTAGACGCCCAGATCCAGCAGCGCTCCGCCACCCAGCTCG 

<AVVEPEEPLFFRAAGLPYVGLDLLAGGGLE 
15909 GGTCGGTAACGGATGTCGCCCGCAGGCAGCGGCGGAAACCCGAAGACGCCGGAGACCATCCGCAGCTCGCCGATCTCGCCGGCCGCGACCAT 

<P RYRIDGAPLPPFGFVGSVMRLEG I EGAAVM 
16001 CCGGCGCACGAAGTGGTGCACCCCATGTCGGAGGAAGGTGAGGTTGTCCATC AACACGAGCCCACGCGACCGGGCCTGGGTCAGCACCGCCG 

<RRVFHHVGHRLFTLNDMLVLGRSRAQTLVAA 
16093 CGGTGTCGACC AGCCGGGTGGTCAGCGGCTTCTCC ACCAGC ACGTGTTTGCCCGCGGCC AGGGCGCGTTCGATCC AGGTGTGGTGC AGCCCG 

<TDVLRTTLPKEVLVHKGAALAREIWTHHLG 
16185 GTCGGCAGCGGAATGTAGACGGCATCGATGTCCGGGCGGTCGAGAACGGACTGGTAGCCCTCGGCGGCGGCGCACCCGAACTCCGCGGCGAA 

<T PLPI YVADIDPRDLVSQYGEAAACGFEAAF 
16277 GGCGCGCGCCTTGGCCAGTTCCCGCGCCGCGACCACCACGAGCTCCGC 

<ARAKALERAAVVVLEAEPVRRIAPLARRRAI 
16369 TGTCGGCGCAGCCGAGAACCCCGATGCGGACCGTCATCTCCGCCATCGGGTCACCACAGGCTGCGCAGGCAGGCCAGCAAGCTGCGGGCC 

<DACGLVG IRVTMEAM <• WLSRLCALLSRA 
16459 TCGATGTTGAGGTAGTAGCCGTGCCGGAGCAGGGCCTCCAGCTGGCGCACCGTCACCCAGCAGAACTCGTCCGGCACCTCGGTCGGGAAGTC 

<EINLYYGHRLLAELQRVTVWCFEDPV£TPFD 
16551 GTCGCCGGCGTCGACCAGGAGGTAACGGTTCTCCGACCGGTAGAATCGCCCGCCCTCCTCGGTGAGC ACGGTGTCGTAG AGG ACACGCTCGG 

<DGADVLLYRNESRYFRGGEETLVTDYLVREP 
1664 3 GGGCGGCTTCCAGCACCTCGGCCAG^AACAGTGGTCOGGGGCCGG^ 

<AAELVEALFLPRPGFC;NDPICQVTPGMEMA 
16735 TCGAGCAGCCCCGCCTGGTAGCGCGCGTGC ACCAGC AGGTGCGCCACTCCGTCGATTTCCTTGACCAGGAAGGCG ACCACGCCCCGGTGCCG 

<D LLGAQYRAHVLLHAVGDIEKVLFAVVGRHR 
16827 CG<5ATACAACAGCG<;CTGACTCCACCCGGTCACCTCGCGGTTGTCGATGCGCACCGTGACGCCGACCACGGAGAAGTGCCGGCCGTCGTCCC 

<PYLLPQSWGTVERND I RVTVGVVSFKRGDDR 
16919 GCGCGATCCCGTCCGCGGTGTGCCGCCAGTCCGGC AGCCCTCGTAGCGGCACCCGCCGCACGGTCATCTCGTGCCGGCCCTTGGCCCCGGTG 

<AIGDATHRWDPLGRLPVRRVTMEHRGKAGT 
17011 AACCAGCTCAACACCGACACCAGGTCGTGGCGGCCGGGGGCGCCCGCCGCCGAGCGCACGATCGCCGCCACCGCCGGCGACGACGGCGCCTC 

<F WSLVSVLDHRGPAGAASRVIAAVAPSSPAE 
17103 GGTCTCCTGGGCCGCGGCGTAGAAGGCCG AGGGCAGACATGACAGCACCGTACGCGTGTCCATGTTGACC AGGCCGTCC ACCCGC AGCAGCG 

<TEQAAAYFAS P LC SLVTRTDMNVLG DVRLLA 
17195 CGAGCAGTTCGCGCAGCGGGAGCCACCGGTGGTAGTCGCCGGCCGGCACGTCCTCGTCGACCTGGACCACCATGTTCCGGTTGCGCTTGCGC 

<LLERLPLWRHYDGAPVDEDVQVVMNRNRKR 
17287 AGGAACCAGGAGCCCTGCTCCGACTGCAGGACGTCAACCAGCACCCGACCGGCCCCCGGGCGGGTGAAGTAGTCGAGGTACTTCGTGCCGCC 

<LFWSGQESQLVDVLVRGAGPRTFYDLYKTGG 
17379 GCCACGGTGCACCCGGGTGTAGTTGCTCCGGGTGGCCTGCACCGTCGGCGAGAGCTC 

<GRHVRTYNSRTAQVTPSLQMVNINGPEVKAQ 
17471 GGAGCAGGCAGTACGGTGTCCCGTCGACGACCTTG ACGAGCATGCCG AGGATGCCG ATCTCCGGCTGGTTG ATG ATCGGCTGGTGCCATTCG 

<LLCYPTGDVVKVLMGLIGI EPQNI I PQHWE 
17563 CGC ACCGCGCCGTAGGTGGTCTGGACGTGCAGCCCCTCG ATCACGAAG AACCGGCCGCTCTCGTGCCCG AGGTTGCCGGTC ACCGGGTCGAA 

<R VAGYTTQVHLGEIVFFRGS EHGLNGTVPDF 
17655 CGCCCACCCGGGCAGCCGGTCCAGCGGC ACGCGGTCCACCCGGCAGT AGGTCG ACCGGGTCCGCTCGGCG AACC AGG AG AGGAAGTCCGGCC 

<AWGPLRDLPVRDVRCYTSRTREAFWSLFDPR 
17747 GGACCCCCTCGGCGTGCAACGCCGACCACGAGCCGCCGTCGACCGGGCCGGGCCGGTGCGGGCCCGCGTCCCGGGTCAGCAACGGCCCGTCG 

<VGEAHLASWSGGDVPGPRHPGADRTLLPGD 
17839 GCGCGGACCTTCGG^TCGGGCGACGAATCGCT^ACCACAGCTCGGCCAGTTCGGCGGAAAAGGCGTGGGAAGACCCGTCCGTGTCCGCCAA 

<ARVKPDPSSDSV 

<• WLEALEASFAHSSGDTDAL 
17 930 CAGXSCGCAGATCGCTGTCGACCATCATGGCGACCATCTCCTCGAAGGAGACGGAGGGTTTCCAGCCGAGCCGCTGGCGGGCCTTCGTCGGAT 

<LRLDSDVMMAVMEEFSVSPKWGLRQRAKTPD 
18022 CCGXZIG^AGAGCAGCTCGACCTCGXaCGX^CCGXSATGAGCGACTCGTCCACCACCACGTGGTCCCGCCAG 

<ACLLEVEAPRI LSEDVVVHDRWNLGVHAFA 
18114 GCCTCGACCAGCTOGCGGACX5CTGTGCGTGACCCCCGTGCCGAGGA(^ 

<AEVLERVSHTVGTGLVYDEPEDQALMLVMGR 
18206 CACGTAGTCGCCCGCGAAGCCCCAGTCCCGCTCGXiCCGAGAGGTTGCCCAGGCGAAGCGAGCTGCGAATC 

<VYDGAFGWDREASLNGLRLSSRIGLKVAAVG 
18298 CCAGXTGACACCTTGCGGGTGACGAACTCGGGACCACGCACC 

<LSVKRTVFEPGRVPSEKNFLIGSVAYMGYS 
18390 TCACGGTAGTTCTGCACCATGTAATGCCCGAACGCCTTGGCCGC 

<ERYNQVMYHGFAKAAGYPSRPHFPTLENOVP 
18482 CTCCCGCACCTTGCCGAACATCTCCGACGAAGACGCCTGATAA^ 

<ERVKGFMESSSAQYFRPQGAAPSRSDSLGGV 
18574 CGATCCG<1AAGGCTTCGAG<IATGCGGAGCACACC^ 

< IRLAELMRLVGMGTVEATTTSQRWSVPVYS 
18666 AGCGCGCCGAGGTTGTAGACC*rcGTCCG<]KXXrCGCGC^ 

<L AGLNYVEDPAAREIAAVLSTQDLLDGS I L X 
18758 GACCGCTGGATCAGGTTGCCX^AAGGCTGCGGACCGAGGGCGCGGTCTGC^ 

<VAPDPQRLSRVSPATQGRVLGFVEYGSQLLH 
18850 GCTCCGCGAGATACGTGCCGTCCTGGCCGGTAATTCCAGTGATC AGCGCCCGCCGrTCTCAGG^TAGTCTCC AGCCGTGAAGCC ACCTGGCC 

< EALYTGDQGTIGTI LARRTLTTELRSAV 
18941 GAGGCGTGACCTCGCGCCGATGGCGGACCAAAGATCCGCCCGTTCGAAATGGGGTCG^ATCTCC 

19033 CCAAGCGGATTCAGXTGACCCGXSAAAGCAATATAGGGAGG^ 

19125 GGATCGCCCCGTTCGGCCGGCGGACCCATCCCGGACACCTTCCACCGAAGCTCTTCGGGATCGTCGCCCGCCGATGCGA^ 
19217 ACCGTTTTGTCCCCCTAACGTCGGCGAGGCTGCCAGCCGGCCeGCGAGCCGG^ 
19309 GGCTC ATGTCGCAGAGCCG<X:CCGCGXK:CGCAGCACCCCCAGCCCGCC^ 

>MVALVAVM I 

19400 CCCGATGGTCCTGGCCACCCTCGAC AAC ACCATCATCGGCAC^ 

>PMVLATLDNTI IGTALPTVVGELGGLSTLS 
19492 GG<jTGATCACCTCGTACACGCTGX3CCACGX3CCGCCTCCAC^ 

>W VITSYTLATAAST PVWGKLADMYGGKVVFV 
19584 GCCACGCTX^TCGTGTTCCTGGCCGGGTCGCT 



Figure HE 



>ATLVVFLAGSLLSGMAQSITQLTVFRAVHGL 
19676 CGGCGCGGGCGGCCTTGATGGTCTGCGCGTTCGCCATCATGGTGGAGGT^ 

>GAGGLMVCAFA IMVEVLAGPDLPKYQG IMS 
19768 CGACCATGGGCCTGACC ATGGTGGCGGGCCCGCTCGTCGGCGGCCTGATCACCGATGAGCTCGGCTGGCGCTG 

>A TMGLTMVAGPLVGGL ITDELGWRWCFY I NL 
19860 CCGATCGGGGCGGTCGCGCTGCTCATCGTGGTGCTGATGAT^ 

>PIGAVALLIVVLMMHLPRRHTKARIDYAGAA 
19952 CCTGCTC ACCGTGGTCAGTTCGTGCGTCGTGCTGGTGACC ACCTGGGGCGGCATCACCTACCCCTGGGCGTCTCCGATG ATCCTGGGGCTGG 

>LLTVVSSCVVLVTTWGGITYPWASPMILGL 
20044 TCGCGCTCGGGGTGCTGACCTGCGCGCTCTTCGTGGTGGTCGAGCGACGGGTGGCCGAGCCGTTGGTG 

>V ALGVLTCALFVVVERRVAEPLVPLAMFRSL 
2 013 6 AACTTCACCCTGAGCACCCTCATCGCCTTCCTGGTCGGC 

>NFTLSTLIAFLVGFALIAGLTFLALFCQAV r Q 
20228 GGGTGCCTCCGCGTCCGACTCCGGCCTGTTGCTGCTGCCCCTGCTGCTGTCCATGGCGGCGGTCAACGTGGTCGGG 

>GASASDSGLLLLPLLLSMAAVNVVGGRLMS 
203 20 GCGGGCGTTCCTACCGGCTGCTGATGCTCGCCGGTGCGGCGCTGATGACCCTGAGCCTGCTGCTCTTCGCCCTG ATGGACGTGGGC ACC AGC 

>G GRSYRLLMLAGAALMTLSLL-LFALMDVGTS 
20412 CGGACGGTCACCGCGATCCCCATGGTCGGCTTCGGCGCAGGGCT^ 

>RTVTAIPMVGFGAGLGLLMQTSLMVALSSVE 
20504 GATGAGGAACCTCG^GTGGCCGCCTCCACGTCCACC^TCTTCCGCACC^^ 

>MRNLGVAASTSTLFRTIGGAVGASATVSLF 
20596 CCGTGCGGGTGCAGTCGGCGCTGGCCGATCGGGGGGTCGCCGACGTGGCTGACCTCCTCGGCCACTCCGCGCGGCTGGACGCCGC CGGGCTG 

>S VRVQSAL.ADRGVADVADLLGHSARLDAAGL 
2 0 6 8 8 GCCCAACTCCCCCGGGCCGTCCGTGTCCACTTCATGCACGCGGTGGCCTCCGGCACCCGGT^GCCTTCCTGATGACCGTGCTGGCGGGGCT 

>AQLFRAVRVHFMHAVASGTRWAFLMTVLAGL 
20780 GATCTGCGTCGCGGCGGCGTGGTTCCTGCGCCGX3GTCACCCCGTTGACGTCGGCACCGGTGGCACCCGAACCGGCGCGCGACGTCGCCGCGC 

>ICVAAAWFLRRVTPLTSAPVAPEPARDVAA 
20872 CCGCCGCCAGCAGCGGGCGCGCGCCGAACTACT AGCGGATTTCCTAGGGTTCCTCGTCGACGGTAGAGCTGAATTC ACCGGCGACCT AAC A 

>P AASSGRAPNY • 
2 0963 TTCTTTTCGCGATCCGGAATCCGTCCATTCCCCTGTCTGGCGATGGTCGACGGGCCGGCCCGTGCCGAGCGGACAGACACA 
21055 TGGAGCTCGATGTCCAGCAAGATCCTAGTCATCGGTGGAGGTCCGGCCGGATCC ACGGCCGCCGCGCTGCTCGCCCGATCGGGGCTGTCG 

>MSSKILVI GGGPAGSTAAALLARSGLS 
2114 5 GTGACGCTCCTGGAAAAGGAGACGTTCCCGCGATACCACATCGGCGAGTCGATCGCGTCCTCGTGCCGCACCATCGTCGATTTCGTGGGCGC 

>VTLLEKETFPRYHIGESIASSCRTIVDFVGA 
212 37 TCTCGACGAGGTCGACTCGCGGGGCTACCCGCAGAAGAACGGGGTCCTGCTC 

>LDEVDSRGYPQKNGVLLRWGNEDWA1DWAK 
21329 TCTTCGGTCCGGGCGTGCGGTCCTGGCAGGTCGACCGGGACGACTTCGACCACGTCCTGCTCAACAACGCCGGCAAGCAGGGCGCCAAGATC 

>I FGPGVRSWQVDRDDFDHVLLNNAGKQGAK I 
21421 ATCCAGGGCGCGGCTGTCAAGCGGGTGTTGTTCGACGGTGAGCGGGCCACCGCCGCCGAGTGGTTCGACCCCGAGTCGGGTGAGGTCCGCAC 

>IQGAAVKRVLFDGERATAAEWFDPESGEVRT 
21513 CATCGATTTCG ACTACGTGGTCGACGCGTCCGGCCGGGCCGGGCTGATCCCGTCCCAGCACTTCAAGCACCGGCGCCCCACCGAGACGTTC A 

> IDFDYVVDASGRAGLI PSQHFKHRRPTETF 
21605 AGAACGTGGCCATCTX3G*K3CTACTGGCAGGGTGGCTCGC^ 

>KNVAIWGYWQGGSLLPNSPSGGINVISAPDG 
21697 TGGT ACTGGGTC ATTCC GCTGCGCGGCG ACCGGTAC AGC ATCGGCTTCGTCTGCC ACC AG AGC CGCTTC CTGG AGCGGCG C AAGG AGC ACGC 

>WYWVI PLRGDRYS IGFVCHQSRFLERRKEHA 
217 8 9 CTCGCTGGAGGACATGCTCGCCGCACTGGTACAGGAGTCC^ 

>SLEDMLAAL\ QESPTVRGLTANGTYQPGVR 
21681 TGG AGCAGG ACTTCTC GT AC ATC TC CG AC AGCTTCTGCGGGC CCGGCTACTTCGCGGC CGGCGACTCCGCCTGCTTC CTGG ACCC ACTG CTG 

>V EQDFSYI SDSFCGPGYFAAGDSACFLDPLL 
21973 TCCACCGGCGTGCACCTCGCCCTCTACAGCGGCATGCTCGCCTCGGCGTCCATCCTGGCCACCATCCACGGTGACGTCACCGAGGAGGAGGC 

>STGVHLALYSGMLASAS I LATI HGDVTEEEA 
22065 GCGGGCGTTCTACGAGTCCC^CTACCGCAACGCCTACCAGCGCC^ 

>RAFYESLYRNAYQRLFTLVAGVYQQQAGKR 
2 2 1 57 CATACTTCGGCCTGGCCGACGCGCTGGTGCACGACAGCGGCGAACCCGAGT ACGAG AAGGTAGACGGGGCCCGCGCCTTCGCCCAGCTCGTC 

>A YFGLADALVHDSGEPEY EKVDGARAFAQLV 
22249 GCCGGCCTCGCCGACCIWACGACGCGGCGGA 

>AGLADLDDAAEGRHDSTAAAAPAEQDNSVRQ 
22341 GCTCTTCCTGGCCGCCGAGGAGGCCCGCCGGATGG^ 

>LFLAAEEARRMADARTPSAPVSEAPGKLDS 
22433 ACGACCTCTTCGACTCGGCAACCGGCCTCTACCTGGTCA 

>H DLFDSATGL.YLVTTPRLG I RRAKPADTQAA 
22525 GCAGAGCAGTCTGCCTGAGGTTCCACCCCTC^TGGCCCCC^ 

> A E Q S A • 

22616 GCATCCGGTGCCGGCGGCTGAGCAGGGGCAGCGCCACCGACTCCGGCCCGT^ 
22708 GTTGCTCGCCCTCG^CAGCGTGGCCGCCGGAGGCACCGTCATGTCACGCTGCCT^ 

>MS RS LR RDAQAAQAAPA 
22798 GTCGCCCGCC AACCCGC ACGCCGGGCACGCCGCCCCGGTGCCCAGCCGGGTC AGCACGACCACG<;TCGCGGTCACCCCGTTCACCGAGCCGA 
>SPANPHAGHAAPVPSRVSTTTVAVT PFTEP 

22890 tgcccgtcccgccgcc^crrc^ccccggtctcccgccg^ 

>mpvpprltpvsrrdgidvyei pirpaqvqi l 
22982 cccggccikxrtcacgcccgcctacacctacgccggttc 

>pglltpaytyagsfvgptirartgrpvrity 
2 3074 caccaacgggctcgacacccacgccaacgtgcacctgcacggcgggcacgtc 

>tngldthanvhlhgghvpatsdghpmdli p 
23166 cgggcggctcgaaggtctaosactacccgaaccitcagcgcg^ 

>pggskvydypnlqrgatlwyhdhthayeadh 
23258 gtctaccgcggactgcacggcttctatctgatcgacgacccggccgag^ 

>vyrglhgfyl i ddpaehhlrlpagk ydvp i m 
2 3350 gcrccgcaacgcccagttcgacgactccggcgccctcgtcttcggcc acccggacgaccgggtc accatcctggcgaacggcaaggcccagc 

>lrnaqfddsgalvfghpddrvti langkaq 
23442 cctacttcgaggtggcccccx:gcaggtaccggttcc^ 
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>PYFEVAPRRYRFRLLNAALKHVFRLNLGGEP 
23534 CTCACCCGCATCGCCACGGACGGCGGGCTC 

>LTRIATDGGLLPAFTSHTELALSSGERVEly 
23626 G ATCG ACTTCGC CGAGC ACGC AGGCGGCGGGCCGG T CT ACCTCT ACG ACGGGGACAACCCGATCCTG CGCTTCG ACGTGTC GTC C CGGGCGG 

> IDFAEHAGGGPVYLYDGDNPILRFDVS SRA 
23718 TCACCGACCCC AGCCGGGTGCCGGTCACCCTGCGCGCACTGCCCCCG ATGGGCACGCCGACCGTGGAGCGCACCGTGTCGATG AGCTTCGAC 

>VTDPSRVPVTLRALPPMGTPTVERTVSMSFD 
23810 ATGTCGGCCCGGCCCCCGATCGCGCTCATGGACGGCA 

>MSARPPIALMDGKPFDPLRVDVQVKRGSTEI 
23902 CTGGAACGTGGTC AACGCG^TACCGATCCGTTCCCCTTCGACCATCCGTTCCACCTGCACCTG^ 

>WNVVNADTDPFPFDHPFHLHLVTFRVLGRD 
23 994 GCGGGCCGCCCGCGCCGGAGGACGCCGGGCTCAAGGACACCGTCTACGTCTCGCCCAAGGGGTCTGTCAAGATCC 

>GGPPAPEDAGLKDTVYVS PKGSVKI QVTFAT 
24086 CCGTACCTCGGGCAGTACGTCTACCACTGCCACTACCTGGAGCACTCGTC^ 

>PYLGQYVYHCHY LEHSSLGMMAQLEVVP • 
24177 TCAGCCGTGCAGGTCG ACG ATC G AGGGGTGGGCGC CG AAC AGGCTGAC CGGCCGCACGTCGCCC ACCCCG AAC C CGG CGG CG C GGG C C AGG 

<• GHLDV I S PHAGFLSVPRVDGVGFGAARAL 
242 68 TCGGCCTGGTCGGCGAACTCGTGCAGCAGCAGTACCGCCCGCCCGCCGTCGACCGTC ACCCGGCGCAGCTCCGCGAAGAGGCGGCCGGAATC 

<DAQDAFEHLLLVARGGDVTVRRLEAFLRGSD 
24360 ACCGGCGAGCACGCCCCGCGCCTGCACCTGCCGGTCCCAGGGCGGATTGCTGACCACCCGGTCCACCCGACCGGTCCGCAGCGGCAA TCGTC 

<GALVGRAQVQRDWP PNSVVRDVRGTRL PLRG 
244 52 CGGCGTCGGCGACCGCCCAGGTGACGCGGGCCCCCGACGCCGCCGAGTTGGCGACGGCCGCGCCGACCGTCTCCGGGTCGTGGTCCGAGCCG 

<ADAVAWTVRAGSAASNAVAAGVTEPDHDSG 
24544 AACAGCACCGCCCCCGGTGCCAGCCCGGCTGCCTCCACGGGGATCGTGCCGGTGCCGCAGCACGGATCGGCCACC AGC ATCCCGGGGCGGAT 

<F LVAGPALGAAEVP ITGTGCCPDAVLMGPRI 
2463 6 GCCGGCCAGCCAGGCCAGCGCCGCGGCGAGCGGCGG ATGC AGGGTCCCCGGCGTGGACGACCGCTTGTAGGCCCGCCGGTGC AGCGGCCGGT 

<GALWALAAALPPHLTGPTSSRKYARRHLPRD 
24728 CGGCCACCCGTACCGCCAGCGTCGCCTGGGTGCCCTCGACGGTGACCCGCAGCGACAGCCCGCCCTCCGGCGGGGCCTCGCCGCCGCGGCGG 

<AVRVALTAQTGEVTVRLSLrGGEPPAEGGRR 
24820 GAGTGGTAGCGCAACCCGAGCGCGGCCACCGCGTGCCGCCCCACGGCGTCCTCGATGTCGTACCGGTTGTAGTTGCGGCGGCCGAGGAAGGA 

<SHYRLGLAAVAHRGVADEIDYRNYNRRGLFS 
24912 GGCGGCGACGTCCACGGTCGCCGGCCGCCCGGGCACGCCGCAGGCGGCGCGGGCCGGC AGCACCGCCGGCAGCGCCGCCGCCCGGGCCAGCC 

<AAVDVTAPRGPVGCAARAPLVAPLAAARALR 
2 5004 GGGTGAAGGCCGCCAGGTCCGCCTTGGTGTGGCCGACGCCGTCGGCGACGGCGACGAGCAGGAACAGGTCGTCGACGGTACGCAGATCCAGC 

<TFAALDAKTHGVGDAVAVLLFLDDVTRLDL 
25096 AGGCGCGGCTCCGCGCTGGCGGCGGAGAACCACACCTCGCGGTGCCGGCGGTGCTCGACCCGGCCGAGGCCCCGCTCCTCGATCTCCTGGGC 

<LRPEASAASFWVERHRRHEVRGLGREE I E Q A 
25188 GGCC ACCTCCTCCAGCCCTCGCAGGGTCCGTGCC ATGAAGCGCACCGTCACCGTCCGCCGATCCTCTCCCCGCCGGGTCGTCCCGCCGCC 

<AVEELGRLTRAM 
2 527 8 GTGCCGGCGGCGTCCGCGAGCCTACCCACCGCACCCGCCGCCGGGACGCAGCCGACGCCGACCGGACCCGCCGACGGGGGAGCCGACGCCGA 
25370 CCGGACCCGCCAAGACTAGGTGAACCTCTATAGGAATTCGCGTGCCCCCTTC ATAGGGTCCG AAAGGGGTAATGG AACCGTCCGGCACCGGA 
25462 CGGCTCGTTTTCTTCCCCCAATTCCGTCCGTCCGACC^ 

> M 

2 5553 CCGCACGGGCCCGTGCGCCGAAATCGTGGAGATTGCGCAGTGCGTACACCGG ATCTGTTC ATCGGCGCCGTCGGCGCCTTCGTCCCGCCGAC 

>PHGPVRRNRGDCAVRTPD^F IGAVGAFVP PT 
2564 5 GGTGAGCGTCGAGTGGGCGATCGACCGCGGTCTTT ACTCCCGCGAGCAGGTGGAGCTGCACGAGCTGGCGGGC ACGGCC ATCGCCGGCG ACC 

>VSVEWAIDRG T *YSREQVELHELAGTAIAGD 
25737 TGCCCGCGCCGGAGATGGCGCTGCGCGCCGJCCAACAGGCGGTCAAGC 

>L PAPEMALRAAQQAVKRWGGSPTEFDLLLYA 
25829 AGCACCTGGCACCAGGGGCCCGACGGCTGGCCGCCGCACTCCTATCTCC AGCGGCACCTGGTCGGCGGCGACCTGCTGGCGTTGG AGATCCG 

>STWHQGPDGWPPHSYLQRHLVGGDLLALEIR 
25921 GCAGGGCTGCAACGGGATGTTCAGCGCGTTCGAGCTGGCCGCC AGCC ACCTCCAGGCGGT ACCCG AGCGCACCAGCGCCCTGCTGGTCGCCG 

>QGCNGMFSAFELAASHLQAVPERTSALLVA 
26013 CCGACAACTACGGCACCCCGATGG TCGACCGCTGGCGG 

>A DNYGTPMVDRWRMGPGF IGGDAGSAL I LTK 
26105 CGACCCGGCTTCGCGCGGCTCCGCTCGGTCTGCAC^ 

>RPGFARLRSVCTKSVPEAERLHRGDEPLFPP 
26197 GAGCGTCCTCACCGGCCGGGAGCTGAACTTCACCGCCCGGATCG 

>SVLTGRELNFTAR I DQQFAARSPAS IAMAD 
26289 TCGGCGACCACATCGAGGAGGTCGTGGGGCGCGCCCTCGCCGAGGC 

>VGDHI EEVVGRALAEAEI EVGDLARVAFMNF 
26381 TCCCGGGAGATCATGGAGCAGCGCTGCCTGGCC AACTGGGGCCTGCCC ATGAGCCGG1^CACCTTC<3ACTTCGGTCGCCGGATCGGGCACTG 

>SREIMEQRCLANWGLPMSRSTFDFGRRIGHC 
26473 CGGGGCGAGCGACCCCTTGCTGGCCCTGGAACACCTGGC^ 

>GASDPLLALEHLARTGGLGPGDHLLTLGTA 
26565 CGGGCGTGGTGGTG^TCGTGCGCGATCGTCCAGGTGATC 

>PGVVVSCAIVQVI ESPTWRE* 
26656 GCCCAGCAAACCGACAGCAGGGGATGATTGTGGAAGCAGAGAAGGACCGGTTGCGTCCGGTGGCGTCC 

>VEAEKDRLRPVAS E A V A V V G 
2674 6 TCGGCTGCCGGTTCCCGGGCGACGTC AACTCGCCCGACGAGTTCTGGGACCTGCTCACCGGGGGTCGC AAC ACCACCGGGACGGTGCCCGAG 

>IGCRFPGDVNSPDEFWDLLTGGRNTTGTVPE 
26838 GAGCGCTGGAGCGCGTACCGCGACCTGGGTCCGGCGTTCGAGTCCGCGCTCCGG 

>ERWSAYRDLGPAFESALRSATRAGNFLADIS 
26930 CGGCTTCGACGCGGACTTCTTCGGCATCTCCCCGCGCGAGX3CCGAGCnXlATGGACCCGCAGCAGCGG 

>GFDADFFGI SPREAELMDPQQRLMLEVTWQ 
27022 CGCTGGAGGACGCCGXjGATCCCGCCCCGCACCCTG^ 

>A LEDAGI PPRTLAGTDVGVFAGVCTYDYGGH 
27114 CAGTTGGAGX3ACCTGCCGCACATCGACGC<ri<K3ACGG<SCATCGGCGCCGCCACCTGCGCCGTCGCCAA 

>QLEDLPH IDAWTG I GAATCAVANRVSHVLDL 
27206 GCGCGGGCCGAGCC*TCTCGATCGACACCGCCTGCTCGGCGTCGC^ 

>RGPSLSIDTACSASLVALHLAAQSLRLGES 
27298 CGCTGGCCCTCGCCGGCGGGGTCAACCTGATCGTCAC^ 
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>TLALAGGVNLIVTPGQSITLGSAGALAPDGR 
27390 AGCAAGTCCTTCGACGCCACCGCCGACGGCTACGGTCGTGGCGAGGGGTG^ 

>SKSFDATADGYGRGEGCGVLVLKLLSDAQRD 
274 82 CGGGGACCGGGTGCTGGCCGTGCTGCGGGGCAGCGCCGTC AACCAGGACGGCCGGACCAACGGGATCATGGCACCGTGCGGC C AGGCCC AGG 

>GDRVLAVLRGSAVNQDGRTNGIMAPCGCAQ 
27574 AGCACGTGATGX3TCCGCG<:CCTGCGCTCG<5CCGG 

>E HVMVRALRSAGI EAGSVDYIEAHGTGTPLG 
27 666 GACCCGATGGAGGCCGCGGCGATCGGCTCGGTCTACGGGC AGGACCGCCCGGACGACGAGCCCTGCCTGATCGGTTCGGTC AAGTCC AACAT 

>DPMEAAAIGSVYGQDRPDDEPCLIGSVKSNI 
277 5 8 CGGCCACCTGGAGGGCGCGGCCGGCGTCGCAGGCGTCATC AAGGCGGTCCTGGCGCTGAACCGGGCCGAGGTGCCCGCC ACCCTGCTGGTC A 

>GHLEGAAGVAGVIKAVLALNRAEVPATLLV 
2 7 B 5 0 CCGAGGTC AACCCGG AC ATCGAGTGGAAGCGGCTGCGGCTGCGCCTGGTCACCCGCAACCAGCCCTGGCCGGACCGGCCGGGGCCGCGCCGC 

>T EVNPDI EWKRLRLRLVTRNQPWPDRPGPRR 
27 94 2 GCCGG AGTCTCCGGCTTCGGCTACGGCGGC AC CG TGGCGC ACGTGGTGCTGG AACAGGCCCCGCCGGTC GC CGC CG AGC CGG C C CCGG CGCT 

>AGVSGFGYGGTVAHVVLEQAPPVAA EPAPA L 
2 80 34 GACCGGCGAG ACGCTGTTCCCGATCTCCGCCGGCTCCGCGC ACTCCCTTCGCGAGCGGGCCCGCGCCCTGGCCGGG ATCGTCCCGGATGTCG 

>TGETLFPISAGSAHSLRERARALAG IVPDV 
28126 ACCTCGCCGCGCTCGGGCACACCCTGGCTCGGCGGCGTTCGCACCTGACCCACCGGGCGGTGGCGGTGGCCGCCGGCCGGGACGACCTGGTC 

>D LAALGHTLARRRSHLTHRAVAVAAGRDDLV 
28218 GCGGCGTTCGCGGCGCTCGCCGACGAC AGGCCGC ACGACCGGGTGCGTACCGGAAGCCCGGTGGCGGAGCCGCCCCGCACGGTGTGGGTGTT 

>AAFAALADDRPHDRVRTGSPVAEPPRTVWVF 
28310 CTCCGGGCACGGGTCGC AGTGGACGGGCATGGGGCGGGAACTGCTGGCCACGGAGCCGGCCTTCGCGGACGCGATCGACCGC ATCGAGCAGA 

> SGHGSQWTGMGRELLATEPAFADAI DR I E Q 
28402 TCTTCCTCGACGAGATCGGTTTCTCACCCCGCC AGGCGATCCTCGACGGCGACTACGAGGCCGTCGACCGGACCCAGAC AATGATCTTCGCG 

>IFLDEIGFSPRQAI LDGDYEAVDRTQTMI FA 
28494 ATGCAGCTCGGCCTGGCCGAGATGTGGCGAGCGAGGGGAGTCGAGCCGGACGCGGTGATCGGCCACTCGGTCGGAGAGATCGCCGCGGCGGT 

>MQLGLAEMWRARGVEPDAVIGHSVGEIAAAV 
28586 G AC CGCCGGC ATCCTG AC CGTGGCC GACGGCG C ACGGCTG ATCTGCCGTCGTTCCCTGCTGCTGCGC G AGGTCGCC GGC C AGGGCGCG ATGG 

>TAG I LTVADGARL I CRRSLLLREVAGQGAM 
28678 CCCTGGTGACGCTGCCCTTCGAGGAGGTCGCGGCC AGGCTGGCCGGCCGCGTCGACGTGGTCGCCGCGATCGCCTCCTCCCCCTCGTCGACC 

>A LVTLPFEEVAARLAGRVDVVAA IASS PS ST 
28770 GTGGTCTCCGGCGACCCGGCCGCGCTGGACGCGCTGGTCGCCGAGTGGACCGA 

>VVSGDPAALDALVAEWTEEGLGVRRVASDVA 
28862 CTTCCAC AGCCCGCACATGGATCCGCTGCTCGACCGGCTGCGCGCCGCCGTCGACTTCACCGCCCGCGC ACCCCGGGTGCCGATCTACAGCA 

> FHSPHMDPLLDRLRAAVDFTARAPRVP I YS 
28954 CGGCGCTGGCCG ACCCGCGGGCCCCGATCACCGCCGACGGCGAGTACTGGGCCGCGAATCTGCGCAACCCGGTCCGGCTCGCCGC AGCGGTG 

>T ALADPRAPITADGEYWAANLRNPVRLAAAV 

2 9046 GCCGCCGCCGTCTCCGACGGACACCGGGCCTTCATCGAGGTCTCCCCGCACCCGGTGGTGACCCACTCGATCCACGAGACGCTGGCCGGAAG 

>AAAV S DG H RAF I EVSPHPVVTHS IHETLAGS 
29138 CCTCGACGACGAGGTCTTCGTCGGCGGCACCCTCCGCCGCGACAC^ 

>LDDEVFVGGTLRRDTPEAQAFLSSLGAAHC 
29230 ACGGGGTCGCGGTCGACTGGGGCCGGGTGCATCCGTCCGG 

>H GVAVDWGRVHPSGPLVTLPGYPWRHRSHWH 
29322 TGGCCGACGCCGGCCGCCGCCACGGGCCGGGGCCACGACCCCGCGTCGCAC ACCCTGCTCGGCGCGGTCGACAACGTGGCGGGCAGCG ACGT 

>WPTPAAATGRGHDPASHTLLGAVDNVAGSDV 
29414 GCGGGTGTGGCGCACCGCACTCGACGACGCCAGCCGCCCGTACCCGGGCAGCCACGCCCTC^ 

>RVWRTALDDASRPYPGSHALNGVEIVPAAV 
29506 TGGTGGAGACCCTCATGGCTGCCGCCGGGCGGGGCG ACGGCCGGCCGCTGCTG ACCGGCTTGTCG ATGCGGTACCCGCTGATG ACCGCCGGG 

>L. VETL.MAAAGRGDGRPLLTGLSMRYPLMTAG 
29598 CTGCACGAGGTCC AGGTGGTGCGGX3ACGGT<3CCGAGGTGCGGCTGGCGTCCCGTTCCGTCG^ 

>LHEVQVVRDGAEVRLASRSVDAEADPSRDWL 
29690 GATCCACACCGACGCCACGGTGGCCGACGCCGACGCCACG<nX3CTCGCCGC^^ 

> IHTDATVADADATVLAARALADPDDHRMEP 
29782 GCXSACCCGGGCTCCATCCACCGCCGGCTCGCCGAGGT^ 

>G DPGS I HRRLAEVGVPSTGFDWSVEEL.LSGY 
2987 4 GGCGTGCTCCGOKXSCGGGTGCGCTCGGCCGACTCGTXTCAC^ 

>GVLRARVRSADSSTWAPVLDAVMSVAPAVFP 
299 66 CGGCGTGKTCGCAG<rTACGCATG<rrGGTGTACGTCGACG AGGTGCTGCTCAC CGGAGGTGACGCTG ATCGAGGTC GCCCTCG 

>GVPQLRMVVYVDEVLLTGEPPEVTLI EVAL 
30058 ACCCAGACCGG^CCGACACGGCGAACGCGCTGGTCGCGGATGCTC AGGGCCG^ 

>DPDRPDTANALVADACGRVVASLPGLRYPVI 
30150 GACCAGCCGGTCGCCCCGGCGCAGGACAGTTCCGGCGAGGTGGAGGAGGCGGTCTCCTTCGCCGGCCTC 

>DQPVAPAQDSSGEVEEAVSFAGLSDEELHER 
30242 GGTGTTCGACGAGGTGCGCCX3GCAGATCGCCGXXXSAGATGCGACTC 

>VFDEVRRQIAGEMRLDADDLHPRRPLAEQG 
30334 TCGACTCGGTGATGACGGTGGTGATCCGGCGACGCCTGGAGAAGCGCACCGOT 

>LDSVMTVVIRRRLEKRTGRSLSPTVFWQRPT 

3 042 6 GTCGCCGCCATCGCCGACCACCI<5GTGGAGCrcTTGAGCArc 

>VAA I ADHLVELLST PQE • 
30517 CGGGCCCTCC(TTGCrrGTCGACGGATCTTCAG<5TG<3CGGGGTCAGCCCG 

<• GPRQDVAPTAAWELHR 

3 0608 GCTCTCCCGCAGGGTCTCCTCGGCGTTCTGTTCGA 

<SERLTEEANQEVRRMAVRLMPPAMVSTVVAV 
3 0700 CCAGCACCACGATGGTGTATGAGGCGGTGTTCAGCACGCCCAGGCGCAGCCCGACCATCG<^GATGATGATCTCCACGGCACCGCGCGCGT^XJ 

< LVV I TY SATNLVGLRLGVMAI X I EVAGRAN 
30792 AGCCCGGCGCCGAG<X;CGACGCCCTCCCAGTGGCTCTGCC 

<LGAGLAVGEWHSQRALRAGLYAGTYKGLVAL 
30884 CGCCAGGATCACCGCCCCGGCGGCGAGCACCTCCGGGTCGGCGAGGGCACGCAGATCGACCCGGAGCCCCGCGCTGGCCAGGAAGATCGGGG 

<ALIVAGAALVEPDALARLDVRLGASALFIPA 
30976 CGAGCACGGACAGCACCACCGTGCGCAGCGGTGCCAGC 

<LVSLVVTRLPALRAPEGNGPLGILVGAVLA 
31068 CC^GAAGATCGC CTCC AGGCCC AGCGCGTGCGCCCCCGCCGCGAAGGCCAGCACGATC^ 
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<GFIAELGLAHAGAAFALVIVVAVASAAGGDP 
31160 CTGGGCGTTCGCCCAGCGCATGGCGCGCCGGGTCACCGGCCGGCCGACCAGGACGGCCACCGCGAGGTAGGCCACGAGGT AAAGCAGGGCGG 

<QANAWRMARRTV PRGVLVAVALYAV LY LLAT 
31252 TCACCACCH^CCCGGCG^TGAGG^TGCTCACCGCCACCGACGAGATC 

<VVQGATLTSVAVSSILSLLFWAAADELSAA 
31344 GCCAGGATGATCTGCCCCACGTCGCGGTGCAGCAGGCGCATCTCGX3TC 

<A L I I QGVDRHLLRMDTLTKAIVPVASVAMAV 
31436 GCCGACGAACAGCGCGAAGACAGTCCGCTCCTGCCCGGCCGCGAGAAGCGCCGCCGGGGCCAGCAGGCCGGCGGCG ATGCCCAGGCCGAGGG 

<GVFLAFVTREQGAALLAAPALLGAAIGLGLP 
31528 GCACCGCGAGGCCGCCGACGGCC ACCGCGCCGATCGTGCCGGCCCGCCGCCGGACCAGCCGCAGGTCC AGGTGCAGACCGGCC ACGGCG ACC 

<VALGGVAVAG I TGARRRVLRLDLHLGAVAV 
31620 AGCAGTACGACGCCGAACTCACCGATGGCGTCGAGCAGGTGGACCTGGTC^ 

<L LVVGFQGIADLLHVQDPDAPLLWRGI D P A L 
31712 GGCCCCCAGCACCGAGX3GGCCGAGCAGCACCCCGGTCAGCAGCTCACCGACCACCGCCGGCAGGCCGAAGCGTTGCGCCACCCGTCCCAGGA 

<AGLVSPGLLVGTLLEGVVAPLGFRQAVRGLV 
31604 CGACGGCGAGCAGCAGCAGCAGGCCCACCTGGAGCAGGAAC AGC AGTAGCTGGTGGGAGCCCAGCGGGGGC ACCGGCGCGGCC ACGATCA 

<VALLLLLGVQLLFLLLQHSGLPPV 
31894 CGGTCGTTGTTCCTTTCGTCTCGACGCCCGGCCGGGCCCGCCGGTGGTCAGGCCGCGATCTCGGGCGGCAGGTCCACCGGGTCGGCCGAGT 

<• A A I EPPLDVPDASN 
3198 5 TCATGAAGGTCCGCAGGGCCCGCAGGTCGGCGGCGGGCTTCTCCCAGCCCAGCCCGGCGCACCAATCGATGAGGGAGGTCAGCTCCC 

<MFTRLARLDAAP KEWGLGACWDI LSTLEGT 
32077 GCCCGAACCTTGTCCTCCTCGGGCAGGCTGAGCACCGACATCTCGGCCGGCCACTGCACCACGTTGGCCAGGTCCACGTCGAGCCCCTCGGC 

<ARVKDEEPLSLV£KEAPWQVVNALDVDLGEA 
32169 CCGGGCGAACTCC AGC AGGTCGCGCAGCCCCCAG ACGTTGTCCCGCTGGGGTGCCACCTGGAGCCAGAGGTTG ACCTCCGAGCGGGCCCGGC 

<RAFELLDRLGWVNDRQPAVQLWLNVESRARR 
32261 GGACGTTCGCGATGAAGGTCTCCCACTTCGCGCCCTGCCGGATCCGCTCG AACACCTCGCCGTAGCCGTCGCAGGAGGCGCCGATGCCGATG 

< V N A I FTEWKAGQR I REFVEGYGDCSAG IG I 
32353 CTCTTGAAGTGCCGGAACCGGTCGAAGACCGACTCCGGCAACACGGTGAG^ 

<S KFHRFRDFVSEPLVTLNSNYVVDVNGANGT 
32445 CTCCACCAGCAGCTCGAGCAGGGCGAAGTGGCCCGGCTGCATGAACGGCTCCCCACC^ 

<EVLLDLLAFHGPQMFPEGGAFYLRRILHANE 
32537 CGCGCAGGGTCTGCC ACAGCTCGTCGTCGTCCCGGTAGGCGTCGATG ACCGCGG ACGACCAGGCCGGCCGCTTCTTGGCGCCCCAGCCCGAG 

<RLTQWLEDDDRYADIVASSWAPRKKAGWGS 
3262 9 CTGACCGGGTACGCGC AC ATC ACGC ACCGCAGGTTGC AGGTGTTGCCG AACCGG ATGTCG AGG AAGAACGGG AAGTC CTCG ACGGTGCCGTC 

<SVPYACMVCRLNCTNGFRIDLFFPFDEVTGD 
32721 GGGCGCGGTGCGGGCGGCCAGCCGGTCGX3GGTCGGCGATGTCCCGGAACCGCTGGTTGATCTCCTGCCGGTACGACAGCGCGCCGTGGTCCT 

<PATRAALRDPDAI DRFRC/N I EQRYS LAGHDE 
32813 CGCGGTGGTAGCAGTAGGAGCAGGCGTCGACCCGCTCCCCGGCCAGCATCGCC AGCCGGGTCCGGCGCATGTTGGGGCTGTTG AAGGCGTC 

<RHYCYSCADVREGALMALRTRRM 
32904 CGCCAGGCCCATCACCCGGCCGGGGTTGTCCGCCGCGTAGCGGG ACCGCGGCG AGC AGCCGACGGCGTCGTCGTTG AGC AGG AACTCCGGCT 
3 2996 CCTCCTCCTGCTCGTACAGCTTGTTGTGGTACATCGAGTCGTCC ACGCAGCACCGCCCGTAGACACCGTCG ATGG ACGCGCAGAGATGGATC 

33088 cagggcac<:acgcacatggtccgggaccgggcttccgggtcagtcatgaaagttgatcacctcggtggt^ 

< • G P R 

33179 gccgaccgtctcgacccggggcagcggaaagatcaaccgggtgccgctggcc AGCATCTCCGCCTCCCGGGCGACG ATCTCGTCCCGGAAGT 

<GVTEVRPLPFILRTGSALMEAERAVIEDRFH 
3 3271 GCCAGGGCAGGACCAGGTAGTAGTCCGGGCCX^CCGCCCGCGACTCC 

<WPLVLYYDPRAARSEQESI I EIDTGLTRAG 
3 3363 ACCTTGTCCGGATTGCGCTCCGCGGCGTACCGGATGAGCTCGCGGTCTATGCCGC AGAACTGCAGGAGGGTGTTGCCCTTCGTCGACGCGCC 

<V KDPNREAAYRI L.ERDIGCFQLLTNGKTSAG 
3 3 455 GTAGACGTGCACCGTGCGGCCCTGGCCCCGCAGCTCC^GCAGCAGGGCGCT^ 

<YVHVTRGQGRLERLLASVEDRHQ.RVQEAFRG/ 
33547 GGTACGGGGCGTCGCCGTCCAGCCCCAGCGCCAGCTCCCGGTC^ 

<YPADGDLGLALERDALAKVSGDARGGVEGA 
33639 CGGXjTGACCACGCAGCAGATCGAGCCGCCGTTCACCCCGTTGAGGCTC*^^ 

<R TVVCCISGGNVGNLSARVI ELGAAGLIRSL 
33731 CGTGGCCAGCGAGTAGTAGCLACAGGTGCTCGT\3GCAGAT^ 

<TALrSYYSLHEHC I SDYGAI ELMAPLYAVEVV 
33823 CCCAGACCCCGCCCGG<^GAGCAGC<3CCrTCGACCTGACG<3G<X;AAC^ 

<WVGGPALLAEVQRAFEVPDEVDYFMAI STV 
33915 AGGTCGAAGCTGCCCGCGTG^GGGACCAGCTCGGGGCTGGGGAAGAAGTC^ 

<L DFSGAHPVLEPSPFFDRILNFDDPADDAAS 
34007 GGAGGGGTCGATGCCCCACCGCTGCGCGTCC&TCAGGTTG^ 

<SPDIGWRQADTLNGLLTGDNCGIDLVKGPRE 
34099 CCCCCAACACCTCGACCGCC^CGTCCACGACGTCGGCX^AGATC 

<GLVEVAADVVDALHRRMTDNI RSRYWYTDY 
34191 AGCAGCCCACCCGGCAGGGTGTGCCGCAACTGCACCAGGCCGCACGGGTC 

<L LGGPLTHRLQVLGCPDGGDREACRTLELPF 
34283 GCGCACCCTCKXXXXMTCCGACACGCCGGGCTTCACGAAGC^ 

<RVRPPDSVGPKVFSGQLYQAGLDLVTRLTGG 
34375 CACACACCCGXXAGK5TGX3TCCGCTCGACCACCTCCGACGCCAACTCCTCGCCGGCCGCC 

<CVRCTTREVV < • T R R T 

34464 CTCGTGCCGGTGTGKTGGGACCAGGCGCCCCGTCGTGCGG^ 

<EHRHAPGPAGDHPKSAVLLIDLYFPQDPGES 
34 556 TGCGGCCGAGATGCCGGAACGCCCGGTCGAGGTACTCGTCGAGCGCGCGGGGCCGCAGCCGGTCCACC AGCCACAGGGCGCGCAGGGCC AG A 

<RGLHRFARDLYEDLARPRLRDVLWLARLAL 
3 4 648 CCCACCGGGCCGCCGGACGXX?CAGCCGTGCTCGCGGCCGTACCAGCGCAGCAGC^ 

<G VPGGS PWGHERGYWRLLLLLGRPGCTLKVT 
3474 0 GCGGTCGACGGTGAAGCCGGCCCACTCGGCCTGGCGGG^TCAGCCCGTCGGCG^ 

<RDVTFGAWEAQRALGDATWRWLDQGGHEEWV 
34832 CCCCGTGGGTGGAGAGCACCAGCCGGCCGCCCGGX^GCAGCIAGCCGGTAC^ 

<GHTSLVLRGGPRLLRYAERLYADADSVHEL 
34924 ACCTGGGTGGAGAGCACCCCGTCGAACGTCCCGTCCGGCACCOG^ 
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<VQTSLVGDFTGDPVPCRGDRDLAHDAPLSEG 
35016 GCCCGGG ATGTCGGCGGTCTGC AACTCGGCGG AGCGGAAC AGGCCCCGGT ATGGCGAGGTGCC CGCGCC GT AGTCG AGC C AG ACTC CCGTCG 

<GPIDATQLEASRFLGRYPSTGAGYDLWVGTA 
35108 CGTCCCGGACCGCCTCCGCCAGCGCGTCGCGCAGGTCCAGG AAGTGGGCGTATGCCCAGTCCCCGGGCCGGGGCTCGATCCGCTCCCGGAAC 

<DRVAEALADRLDLFHAYAWDGPRPEIRERF 
35200 CGTTCGGCCATCACCTCGTCCAGCGACGGGCCGGTGGGGGCGGTGCCGGCACCGGTCAGCAGGGGACC AAGCCCGGGTACGCGCCGCAGC 

<R E A M 
35290 AGGGTCACCAACGCCAGTACTCCTCGTGCGGGTACC 

<• WRWYEEHPYRMTTPEPLGEWRRLSAAPR 
353 81 GTACTCGAAGGGCTGCTCGCCGGCGTGCTCCGGTGCCGCCCGCCGCTGCGCCAGGCAGGAGAGC AGCACCTGCTGCTTGAGCCCGATGTACT 

<YEFPQEGAHEPAARRQALCSLLVQQKLGI Y E 
3 5473 CCGGCACGTGCGGGCCGAGGTCCAGCTCGACCTCGGCGGGCCGGTACTCGAAGTAGATGACCCGGCGCCGCTTGCCGGTC AC CGCCGGCGCG 

<PVHPGLDLEVEAPRYEFYIVRRRKGTVAPA 
3 5565 GCGTGCAGCATCAGGATGTTGTGCAGCATCACGTCGCCCGGGTTCATCACCGCCGGCACCGCCCCGGT^ 

<AHLMLINHLMVDGPNMVAPVAGTTDWETANM 
3 5657 CCGGGTGGTGGTCTCGTTCGCCCGGTCGGTGTCCCAGTAGTTCGACTGCGGGATGCACCAGACGCAGTTGTCCTCCGGGGCCGGGTCAAGGT 

<RTTTENARDTDWYNSQPICWVCND£PAPDLY 
357 4 9 AGATGCCGACGTCGATCACCCGGCCCGCGCCGGTG ATGCCG ACCGCGTTCTCCGGGTAGAGGCCGCCGTCGCGGTGCCAGGGCAGCCGGGGC 

< IGVDIVRGAGT 1GVANEPYLGGDRHWPLR P 
35841 GCCCCCGCCTCGGTCTTGAAGACCATGCTGTC CC AGGTGGGGATGAGGTTGGGGCCGACCAGGTCCTCCATCGCCCGCAGCAGCAGGGGGTG 

<A GAETKFVMSDWTP I LNPGVL.DEMARLLL PH 
359 3 3 GCCGGCGAGCCGGGCGACCACCGGGGACTTGTCGACCACGTACTCGATCCGCACCGGCGCCGCGTCCGGCTCGTGCGGTTCCAGCGTCCAGA 

<GALRAVVPSKDVVYE IRVPAADPEHPELTW I 
36025 TGGTGTCGGTCATCGTCCGGGTGCGCCAGGCTTCGTCGATCAGCTCGTCGGCCGCCGCCTGCACGGACCGCAGCTCGTCGGGGTCCAGCAGC 

<TDTMTRTRWAEDI LEDAAAQVSRLEDPDLL 
36117 CCGCGCAGGATCAGCGCGCCCTGCCGGCGGAAGGCGGTCAGGTGCTCCGGAAGC^ 

<GRLILAGQRRFATLHEPLLGTEHIHCEPVA<2 
3 6209 CTCGGTGCGGACGTCCACAGTCGCGCTCATGGTTCGGTTCCCTTCTGCCAGGCGGACGGTTCGTGCTGCCCGGAGCCGCCGGCCGGGCC 

<ETRVDVTASM 

< ♦ PETGKQWASPEHQGSGGAPGP 
36300 GGCTCGGTCGGTCGGCGACGAAGTACCAGTGCTCCCGCAGCGCGTCGGCGAACCCCGCCCGGTCCAGCGCCGGCTGCCCGGCGCGGCGCCCG 

<SPRDAVFYWHERLADAFGARDLAPQGARRG 
363 92 GGCAGGGTACGCAGCTCGTAACCCAGCTCGGTCACGAGCAGCGCCCACAGGTCGGCGCTGGTGGTGCCGTACTCCCGCATGGCGTGGTCGCC 

<FLTRLEYGLETVLLAWLDASTTGYERMAHDG 
3 64 84 GCCGTGCTCGAAGACGATCACCGGCCGCCAGCGGCGGAGCAGCTCCACGGCACCGCGCAGGGCGAGCACCTCGCCGCCCTCGGTGTCCACCT 

<GHEFVIVPRWRRLLEVAGRLALVEGGETDVK 
36576 TGACCAGGTCGATCCGGCGGTCACCGGGGAGCACGTCGTCCAGGCGGACGGTGTCG ACCGTC AGCTCCCGCAGGGTCTCGTCCGGGCGGTCG 

<VLDIRRDGPLVDDLRVTDVTLERLTEDPRD 
3 6668 TAGGGACGCCGGCGCAGCCCGCTGTAGCCGGGGTTGGAGACCACGT^ 

<YPRRRLGSYGPNSVVHVFSDRGTREAAAAAV 
36760 CACCGTCACGCCGGGGAAGTCCCGGCGCAGCCCCTCGGCGTACGACGGCAGCGCCTCGACGGCCACGTGCCGGCCACGGGGGGCGACCCGCA 

<VTVGPFDRRLGEAYSPLAEVAVHRGRPAVRL 
3 6852 GCAGGTGACGCAGGATGTCGCCGGCGCCGGCCCCGATGTCC ACGGTGTTGGCGTCCGGTTCGCAGATCTGCTCGATC AGCGCCACGGTGAGC 

<LHRLIDGAGAGI DVTNADPEC XQEI LAVTL 
36944 TGGTCGTACCAGTCGTTCATCGAC AGCGCGCCGGCCTCGTCGGTCGCGGAAAGCTC AGTGGAC ATCGTCACGCTCCTCGGTCCGGCACGCC 

<• HVDDREETRCA 

<Q D Y W D K M 
37035 GOTCCGGCCCCGGAGCCGACCGCCGG^CGGTACGGACCAGGAGTTCCAGCTCC^ 

<P GAGSGVAPATRVLLELERLEI ESLELGAAR 
37127 GACGTTCTCCTCGACGACTCCGGGCGACTGAGCGCCGAACACCGGCACCACCCCGGCCGGGTGGTGCAGCGCCCAGGCCAGTGCCACCTGCG 

<VNEEVVGPSQAGFVPVVGAPHHLAWALAVQA 
37219 CGACGGTGTGCCCACGCTCGKSCGGCGAAGGCGGCGAGGCCGTCGACC^ 

<VTHGREAAFAALGDVVDLLQAYDEGRFAHS 
37311 TAGGCCCGCCAGTCCTCGG<3GGCGAATGCCTGGTCGCG<rrGCAGCGCGCCGGTGAGC AGCCCGTGGGCG AGCGCGGAGCCGCCCAGCACCCC 

<Y ARWDEPAFAQDRHLAGTLLGHALASGGLVG 
37403 GACGCCGGCCTCCTGGCAGCGGGGCAGC ACCTCCTTCTCGGC ACC ACGGTCGAGCAGGTTGAACGGCACCTGG ACG ACGTCC AGC AGCCCGG 

<VGAEQCR P LVEKEAGRDLLNF PVQVVDLLGT 
37495 TCG^ACCAGCTCGGCCAGGTCGCCCGCCGTCACGTTGGCGA^ 

< PVLEALDGATVNAFGVHRALGERVFGALVE 
37587 GC^TCTCCGCGAGCGGGACGGTCGGGTCGGGGCAGTGCACGGAGTACACGTC^ 

<ATEALPVT PDPWHVSYVDVHDTGLQRL SALL 
37679 CTCGTCCtt3GAGGAACGCGGGGTCGCTGTTACGC^^ 

<EDRLFAPDSNRVTRGPPDLKHRRVGPRTELG 
37771 CGCCGGCCGTGGCGATCACGATC^CGTCCCGGT 

<GATAIVI EDRHAPLLDALGRALAAEAAGGG 
37863 TACGCCCGGGAGGTGTCGAAAAGGGTGACGCCCAGGTCGAAGGCCCGGCGGACGGCCTGCACGCC 

<Y ARSTDFLTVGLDFARRVAQVGPEI RRGWQG 
37955 GCCGAGCG^CCAGGTGCCCAG<5CCGAGCGCCGAGACCAGCGGCCCCCGCrCACCGATGCAGCGCTGCCGCACGGCGTCCT 

<GLAWTGLGLASVLPGREG ICRQRV 
38046 CCCCACCCACGCCCGTCGGCCGGCCGTCACGCCGCCGGCTCCCCGTCGCTGCGCAGCGTCAGGAACA 

<• AAPEGDSRLTLFVVGDHNIMK 
38137 TGTCGGTGACCGGCGTCAGCCCGGCCTCGGCGGCG ATGCCGGTC ATCAACTCCACGCCGTGCCAGTTC AGGAACCCCTTC AACTCCACGGGG 

<DTVPTLGAEAAIGTMLEVGHWNLFGKFEVP 
38229 TTCGCGTCGCGGTACCGCTCG<5CGTAGGTGTGGAAGAAGCCGCGCGTGTTG 

<N ADRYREAYTHFFGRTNDGLDLFNFCFLGGP 
38321 CCtSCAGGATCCGCCGGATC'rcACGGAAGT 

<RLIRRIQRFYLFVEFVNLHIFVNLSFGADFA 
3 8413 CGGCGGTCGGCAGCTTCTCCAGGAAGTCGTTCTCGATGTGGT^ 

<ATPLKELFDNEIHKYSVNERGECTARAKDL 
38505 AAGGATCGGCTG ACGTCGGCGCAGAGC ACGGCGCGCACCCGGTXZGGCGAGCCCGGCCGCCATGATGCCCTCGCCGCTGCCG ATCTCG AAGAT 

<FSRSVDACLVARVRDALGAAM 
38597 CTCCGATTCCGGGCCGAGCCCGAGCTGCTCGACGACCAGGGCGACC^^ 
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38689 CGAGCTGCATCTGCATCTCCTCCGKSCGTGTTC^ 

38781 TGCGGCGGCGGCGTCGCCTGCTTCGCGTCGTTCATCGGACCTCA 

38873 GAAAACTGGGCGGGCGGTGTCACGGACGCCGGGCGAACTCGGCGACGTCCT^ 

< • PRRAFEAVDEPSLAGTDALRALL 
389 64 TTCGGCC ACCTGGACGGCGGTCGGCCCGGTGGTG ACGG ACTCCCGCATCCGCTGCGCCGCC ACCCGG AATCGGTGGTCGTAG AGG ACGG A3 C 
<EAVQVATPGTTVSERMRQAAVRFRHDYLVSG 
3 9056 CGAGGGCCTCGTCGACCTCCTCGCGGGACGCCTTCAGGCCGGGCAGCGTCTTCGTCGCGCCCTGCGGGTCGAGCCGCCGCCCGTAGATCAGG 

< LAEDVEERSAKLG PL-TKTAGQPDLRRGY I L 
3 9148 GCGTCGTAGTTGAGCGCCAGCGACAACTGCGGCACGCCCATGX^GAGCCCGTTCATG 

<A DYNLALSLQPVGMALGNMYCNASGHHVLLD 
3 9240 GCAGTCGGGGAGGATGAGCTCCAGCGGGCAGTTGCTGAGC ACCCGCACGTTCGGCGGCAGCGCGCCC AGCCCCTCC ACCTCGG AGGAGGCGG 

<CDPLILELPCNSLVRVNPPLAGLGEVESSAA 
3 9332 CGGTGATCACGACCTCCACGCCCCGCTGGGCGGCGGCGTCG ACGGCGTGCCGCAGCGCCGGCACCTGCGCGCCG AAC ACGCCCGTGGCGGAG 

<TIVVEVGRQAAADVAHRLAPVQAGFVGTAS 
3 9424 TTGCCCCACACCACGCAGACCCGCTTGCCCCGGCGCGGACCGAGCAGCCAGGGGTCCACGTC 

<NGWVVCVRKGRRPGLLWPDVDQSGNYPQYRI 
3 9516 CGGGATCCGCAGCGCGTCGCCCATCGGCGGGATCGCCACGTCGGGCGACGGGTCGATGGCGTACCGGATCTGGTGCCGGCTCCACTCGACGC 

<PIRLADGMPPIAVDPSPDIAYRIQHRSWEVG 
3 9608 CGTACTTGCGGAACTCGGTCACCGGGTCGCCGGAGACC AGGTCG AGCCCGGGCTCGGTCTCGATGGTGCCGATGAACCCGGGCGAGAAGTAG 

<YKRFETVPDGSVLDLGPETEITGIFGPSFY 
3 97 00 ACGCTGGGGATGTGGTGCAGCTCGGCGACCAGCGCGCCCTCCACGGCCATGATGTCGTGGACCACCAGGTCGGGCCGGTAGTGGGCGGCGTA 

<V SPIHHLEAVLAGEVAMIDHVVLDPRYHAAY 
3 97 92 GTCGACCGCGTTGTCGTAGCTGCGCIX^ACCGCGGTGACCGTCCGCT^^ 

<DVANDYSRQVATVTRKWYDALLDTDFDALSD 
3 9884 CCATCG^CCGGCCCGTGAAGGGGTTCAACGGCAGCGGTTGCTCC ACCATGTGCTGCGGGGTGTAGAGGGCCTGG ACGTAGAAGCCC AGCCGG 

<MPRGTFPNLPLPQEVMHQPTYLAQVYFGLR 

3 997 6 GCGCTCTCCATC ATGTCGGGTCCGTCGAGCACCGAGACGGGC ATCATGCCCGCCGCGGCCACGCCCCGGACCTGGGACGGCG AACAGGCGAC 

<A SEMMDPGDLVSVPMMGAAAVGRVQSP SCAV 
40068 CTTGACGTCGTGGCCGGCCGCCCGCAGCGCCCAGGCCAGCGGCACCATGCACATGTAGTGCCCGGCCCAGTTGGACACGGTGAACAGAACCT 

<KVDHGAAR LAWAL PVMCMYHGAWNSVT F LV K 
40160 TCATCGCAGCCTCTCTGTGGCCTGCCGAGGGG AGGTTGGGGGTCGCGGCCGGACGGTCAGGAGGTCAGGACCGGCAACTCCCGCGCGGGA 

<M <-STLVPLERAP 

4 02 50 TCGGCGAACTCGATCACGATGGTGCGCCGCCACTGGTCGGAGGGGTTCGGCCCGGAGCCGTGCACCAGCCGTACGTCGTGGACGATGAAGTC 

<DAFEIVITRRWQDSPNPGSGHVLRVDHVIFD 
40342 CCCCTCCTGCGAGGGGACCGGC ACCCGCGGCCCGGCGTCGCGCACGGCCGTGACGTCGGCGTCGTCGGGC AGCAGGTGGG AGCCGGGCACGC 

<GEQSPVPVRPGADRVATVDADDPLLHSGPVG 
404 3 4 CCTCCAGACAGCCGTTCTCCGGCCCGGCGGTGTCCAG^CAGATGCTGATGTTGCAGACCGCGTGCGGGGGGACGTTGA 

< ELCGNEPGATDLC IS I MCVAHPPVNVRDRH 
40526 CACGG^ACGCCGGCCGCCCGCAGTGGCTCCTTGAGCACCAACGCGAAGGCGGTGGGCACGACCGGG^^ 

<W PVGAARLPEKLVLAFATPVVPTGLVDAAVA 
40618 G<X:GATCTCCGGCCG<3TGCAGCAACTCGCCCTGCGGCCAGTCCTGCTTTTCGAG^ 

<AI EPRHLLEGQPWDQKEL.NH I RY LVPEAGEV 
40710 CCTCGTAGTTCCAGTAGTCGGCGTTGGCCCGCGCCGGACCGGCGAACCGGTCGATC 

<EYNWYDANARAPGAFRDILSVAGAKLQALV 
40802 TCGGGGTCGAGCACCGGCCCGACGTGCGCGATCCCGTCGCTGCGGAACCGGCTGGCCACGGCCTCGCGCTCCCGCGACATGGTGGTCATCG 

<E PDLVPGVHAI GDSRFRSAVAERERSMTTM 

< • R 

40893 GAGGCTC ACCCCTTCGGTCGGTCGGCGCGTGCCGGTGCGGCGGCCG ACGGCG ATGATGTCGC AC ACCTCCGGCG ACCGGCGCTCGTGCAGC 
<LSVGETPRRTGTRRGVA I I DCVEPSRREHL 

40984 ACCTCCACCTCG<jCGAAGCCGGCGTTGTGCAGGGCGGCGAACAG<K;ACTCCCGGTCGAGCCAGCGCACGTCGACGCTGAGCCC 

<V EVEAFGANHLAAFLS ERDLWRVDVSLGRAQ 

4 107 6 CGGCTCGXXX5TGCTCCTCGCGX5ACCTGCTTGACCGTGTAGCCGTCGATCGGTTC 

<PEPHEERVQKVTYGDIPQLDGVGGWYHTSLY 

41168 AGATTCCCGCCGCGACGCCCGCGATGTCCTTCAGCAGGX3TCCACGX3CT^ 

< IGAAVGAIDKLLTWPERVHYLLGACLVADF 
41260 TCGCCCAGCTCGGTGAAGTCGATCCGTTCCACGTCGGCGACGC^CAGCTCCACGTTGGTGATGCCGTTG ACCTCC ATCACCAGCTCCGCGCG 

<EGLETFDIREVDAVRLEVNTIGNVEMVLEAR 
41352 GCGCAGGTTCTCCGGACGGCCCTCCAGGGCAAGCACCGTCGTGCCGGG<rrcCCGGGCGAGGGCAAGCGTGTCC 

<RLNEPRGELALVTTGPHRALALTDAGELAGL 
41444 GTTCGAGGATCCGCCGCGCGTCGGGAAACGCACCGAAGAACTTCGCGGCCCG^ 

<ELIRRADPFAGFFKAARDAPSQSLLYGHQS 
41536 CCCTCGGCGTAACGCACTCCGTCGTGCTCGAATCCATTCACCCAC 

<G EAYRVGDHEFGNV 
41627 GATCAAAGCCTAGCGATGCCATTGCGGTTGCGGACTAGTGTTTTCATCATATTCAGCGG 

<• RSATSLRENVLR 
41718 GGCCC ATTCCGGAATCCGGTCCCCGCCG ATCTCGATATCGAAGGACG ACTGG AGTTCCGCCGGCCCGGCGTCGGCCGCCC ACCTGGCGGTGC 

<AWEP IRDGGI E I DFS SQLEAPGADAAWRATR 
41810 GGGCCAGTCCCTCGGGCAG^GGGGTGTCCGTCCAGTCGCCGAAGAe^ 

<ALGEALPTDTWDGFVSRALETATYATRVED 
41902 CGCGACGGCAGGTGGGCGATCGGATGCTCCGGC ACGCCGGCCGCCG ACCX3TACCGCCTGGGCCAGCTCC AGC ACC^TGTTGG-rGCTCG ACGA 

<K SPLHAI PHEPVGAASRVAQALELVTNTSSS 
41994 GCCC ACGTTG AATGCCCGC CCCC ACGCGGCCTCGGTCTCGG^^ 

<GVNFARGWAAET EAARSVVNVVDGVYT FARV 
42086 CCTGGCCGCCGTCGCCGTACACGGTGATCGGCTCGCCTCGCAGGA^ 

<QGGDGYVTI PEGRLI QNFFIAVANRYPDRM 
42178 TTCTGCCACTCGCCGTAGACGTTGTGCATGCGGAAGGCGGTGAAGGGC AGCCCCTGGGTCCGCATCGTCACCTCC AGCTCGCGCTCG ACCAG 

<NQWEGYVNHMRFATFPLGQTRMTV£LEREVL 
42270 GTACTTGGCCAGGCCGTAGCTGTCCGCGGGGACGGGGACGACGGACTCGCG^ 

<YKALGYSDAPVPVVSERMPTEGHGYVAVSSA 
42362 CGAAACAGAAGAACCGCACGCCGGTACGC AGOG ACGCGTTGATC AG ATTT ATGCTGCCCATCAC ATTGGTGCCGT AGTTG AGCTGCTTC ACC 

<FCFFRVGTRLSANILNISGMVNTGYNLQKV 
424 54 G AATGGCTGATCGCCTCCGCCGCG AAGGCGGCAAAGTGGAAGACCCGCTCG AATCGGTTCTCGGCGAACAGTGAATCGACG AAGTCCACGTC 
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<SHSIAEAAFAAFHFVREFRNEAFLSDVFDVD 
42546 GGTCACCGAACCGACGGCCAGGTCCACCCCGGCCGGAACCCGC^ 

<TVSGVALDVGAPVRQRSGGSLDDLVTVRHGN 
42638 TCCTGACCAATGACTCCACCAGGTGCGAGCCGATGAATCCGGCACC ACCAGTCACCAGAC AACGAACC ATCCGGGGCTCCTTCGTC AATA 

<RVLSEVLHSGI FGAGGTVLCRVM 
42728 GATCCGGAAAGGGTTGACCGAGCGGGCGATCTGCGGCGCGCCCTTCGTCGGCGAACACCGACCCCCGAGAGAAAGCTTCGCCTCAGGGC AC 

< * P V 

42819 CGGCGACCGGTCGGCCTGCTTCTTCAGCGGCTCCC ACCAGTCCCGGTGCGTCCGGTACCAGTCGATCGTCTCGGCCAGGCCGTCGGCGAAGG 

<PSRDAQKKLPEWWDRHTRYWDITEALGDAFA 
42911 CGACCTCGGGCCGGTAGCCGAGCGCCCGC AGTTTCGCGTCCGTCAGCGAGTAGCGGCGGTCGTGGCCCTTGCGGTCCGGC ACCCGCTCG ACC 

<VEPRYGLARLKADTLSYRRDHGKRDPVREV 
4 3003 CGGTCCCACCCGGCCCCCAAGGCGTCCAGCAGCCGCCCGGTCAGCTCCATGTTGGACAGCTCAGCCGTGCCGGCGATGTGGTAGACCTCGCC 

<RDWGAGLADLLRGTLEMNSLEATGA I HYVEG 
43095 GGGGACACCGCGGTCGACGACGGTCTGGATGCCCCGGCA 

<PVGRDVVTQIGRCHDTVHIWDRVNGGDGYLP 
43187 GCACCCGTCGCCCGTTCAACAGCTCGGTGACGAACAGCGGGATC AGCTTCTCCGGAAACTGGTACGGCCCGTAGTTGTTGCCGCACCGGGTG 

<VRRGNLLETVFLPI LKEPFQYPGYNNGCRT 
4327 9 AGGCAGACCGGCAGCCCGTGGGTGCGGGCGTAGGCCAGGGCGATCAGGTCCCCGCCGGCCTTCGCCGCCGCGTACGGGGAGTTCGGCGCCAG 

<L CVPLGHTRAYALAILDGGAKAAAY PSNPAL 
43371 GGGGGTGTCCTCGGC CCAGGAACCCTCGTCGATGCTGCCGTAGACCTCGTCGGTGGAGACCTGGACCACCCGGGCGACCCCGGCGTCGAGAC 

<PTDEAWSGEDISGYVEDTSVQVVRAVGADLC 
4 3463 ACGCCTGCATGAGCGTCTGGACGCCCTGCACGTTGGTGCGGACGAACTCCGCCGAGTCGGCGATGGACCGGTCGACGTGCGACTCGGCGGCG 

<AQMLTQVGQVNTRVFEASDAISRDVHSEAA 
4 35 55 AAGTTGACCACCACGTCGTGCCCGGGCAGCACCTCGGCCAGCAGCGCCGTGTCGCAGACGTCGCCCTGGACGAAGGTGATCCGGTCCTGGAC 

<FNVVVDHGPLVEALLATDCVDGQVFTIRDQV 
4 3647 CGGTTCG AGGTTGGCGAGGTTGCCCGCGTACGTC AGCTTGTCC AGC ACCGTC ACCCGGGCCTGCGCCGTGTCGGGGT AGGC ACCGGTGGCC A 

<PELNALNGAYTLKDLVTVRAQATDPYAGTAL 
43739 GGTCGCGGACGTACTGCG AGCCGATG AAACCGGC AC CGCCGGTG AC C AGG ACGCGACGC ATC AG ACC CC C ACCCGG ACTTC GC TGTG AT 

<DRVY'QSG I FGAGGTVLVRRM 

< ♦ VGVRVESHD 
43828 CGCCGAGGACGAACCGGTGCGTCTT^jGGCACCCGGGGCCCGGGGACCACCCGCGCCTCCCGGCCGATCATCGAGAACTCGATGCGGCCGATG 

<GLVFRHTKPVRPGPVVRAERGIMSF£ I RG I 
43920 CCCTCGATGTAGGCACCGCGCAGCACGATGGAGTGCTCGATCTCGGTCTCCAGCAGGGTGCAGTCGCAGTCGATCGAGGTGTACGGGCCGAG 

<GEIYAGRLVISHEI ETELLTCDCDI STYPGL 
44012 GTAGGAGTTGCGGATGATCGAGCCGGCTCCGACCACC ACCGGGCCGACGATCCGGGAGCCGCTG ACGTCGGCGCCGGCGCTGATCACCACCG 

<YSNRI I SGAGVVVPGVIRSGSVDAGASIVVP 
44104 GGCCGATCAGTTCGGTGCGGTCGTCGACCTTGCCCTCGACCAAAGGCTCGACGCTGCCGAGG ACGAACCGGTTCATCTCCAGC ATGTCGGCG 

<GILETRDDVKGEVLPEVSGLVFRNM£LMDA 
44196 AC^TTGCCGGTGTCCTTCCAGTAGCCCGTGATCATGGTGGAGTC 

<LNGTDKWYGTIMTSDVRHGRDIMWQVADTIE 
44288 C AGCTCGTTGCGCCAGGACGGCTTC AGCTCGGCCACGGCATCGTGGACCACCGGGCTG AAGACGT AG ACCCCGACC AGGGC CAGGTCGCTCT 

<LENRWSPKLEAVADHVVPSFVYVGVLALDSK 
443 80 TGGGGTGCTCGGGCTTCTCCTCCACCCCG ATC ACCCGGCCGTCCGCGCCC ATCTCGGCG ACGCCGAAGGCGTGCGGGTCGGCGACCCGGGTC 

< PHEPKEEVG IVRGDAGMEAVG FAHPDAVRT 
44472 AGC ATGATCTGCGCGTGCGGTCGCTCCTGCCGGAAGCGCTCGACGATGTCCTTG^ TCCCGCCGACGATG AAGTTGTCGCCGAGGTACATGAC 

<LMIQAHPREQRFREVIDKIGGVIFNDGLYMV 
44564 GAAGTCGTCGTCGCCGAGGTAGTCGCGGGAGATGAGCACGGCGTGCGCCAGCCCCCGGGGAGCCTCCTGCGGAAGGTAGGTCACCTGGAGGC 

<FDDDGLYDRS I LVAHALGRPAEQPLYTVQLG 
44656 CGAACTGGGAACCATCGCCGACCACGCGCTGAATTTCG^ 

<FQSGDGVVRQIEPATSGVVIGVEEIGGERI 
4474 8 GCCTCGAGCCCGTAGAACAGCACCGGCTTGTTGGCCACGGGAAT^ 

<A ELGYFLVPKNAVPILQKASTHTI PRLRSGV 
44840 CCCTCCCGCCAGGACCAGCGCCTTCACGAACGCCC 

<GGALVLAKV 
44931 TGGACTGGGCTCTTCGTGAACGTACCGAAGGATCACTCGTG 

< • AD 

4 5022 CCGCCATTTCCGCAAACGXXXXjCCTGGCCGC^ 

<AMEAFPPRAPGGGVLELVAALGHVTDPPSA 
4 5114 GGCAGCAGCAGGGTCTGCAGGCCGGCOTACACCGCCCCACCGTCGGCCAGGG 

<P LLXjTQLGAYVAGGDALTDGVMLAREPAVKL 
45206 CTCGTCGCAGGCGGTGCGGAAGATCCGC^ATCGGGCTTGA 

<EDCATRF I RPDPKVAGVEHSFVYADVLEAMG 
45298 CGTACGCCGCGAAGGTGGC^CGCAGGTCCCAC^CGATGTTGCTGACCAC^ 

<YAAFTPRLDWAINSVVATGVGRRRLEALVP 
45390 GO^GTCGCGGTAGGGCAGCCAGCCXSTCCGTGC^^ 

<A ADRYPLWGDTRFLRDYLAEALGPHPLDVTS 
45482 GAGCAGGCCCACGTACGCGCTGCGGTGCGTTTCGGGCGAGAGGTC^ 

<LLGVYASRHTEPSLDRRAYVEALGPPVAHPE 
45574 CttKXTCGCCGGGACGGCCGGCCGCCAGCAGCAACCCGGTC^ 

<PGGPRGAALLLGTLAEQQAPDLQVGVTAAA 
45666 GCCCGCAGCCAACGCTGCX3GCAGTTCCACGGCGAACAGCGTGCCGGAGAAGTC 

<A RLWRQPLEVAFLTGSFDFLVADI PRPLPTT 
45758 CATCGCTCTCCTCGGTGCAGCCGCGTCAGGGCGGC ACC AGCCTGGCAGCGGACCAGCCGGATGTCCATCATGGAGG AATGCGCCGGGTCG 

< M 

45848 ggcgcgcccgccatggccgccggtccggacgacaggcattttcggtca 
45940 ttcagcgatcctg<;aggcatccgtcacccgtacccgaacc^^ 

>VTRTRTALRRLLAAGLAS LATAA 
4 6030 CGCGACCCnK?GTCC^CACCGCCGGCCCGGCGCCG 

>ATLVATAGPAAAATTPGIDVSHYQGSINWT 
4 6122 GCGTCCGCAACGCGGGCATCCAGTTCGCGTTC ATCAAGGCCACCGAGGGTACGAGCTACAAGGACCCCAACTTCAACGCCAACTACGTCAAC 

>SVRNAG IQFAF I KATEGTSYKDPNFNANYVN 
4 6214 TCCTACAACGCCGGAGTGATCCG<XXaGGCGTACCACTTCGCCCGGCCGAACATCrc 
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>SYNAGVIRGAYHFARPNISSGATQANYLASN 
4 6306 CGGCGGCGCCTGGTCGGCGGACAGTCGC ACCCTGCCGGCCGCGCTGGACGTCGAGGCCAACCCGTAC AGCGGCGGCACGTGCTACGGCCTCA 

>GGAWSADSRTLPAALDVEANPYSGGTCYGL 
4 6398 GCACGTCCGGGATGCGTAGCTGGATCCAGGACTTCCTGAACACGTAC 

>STSGMRSWIQDFLNTYKARTGRYAVIYTTTS 
46490 TGGTGGAACCAGTGCACCGGTAGCTGGACCGGGCCGTGGGCCAACCACCCGCTGTGGCTCGCCCGC^^ 

>WWNQCTGSWTGPWANHPLWLARWSSTPGTLP 
4 6582 GGCCGGCGCTTCGGTCTGGAGCTTCTGGCAGTACACGGCCTC 

>AGASVWSFWQYTASGSVSGI SGNVDRNNWN 
4 6674 GCGACCGCACCCGGCTGATCGCGCTGGCGAACAACACCTGACCCGAACGCCCGGGTAGGCGGTTGGCGGCAGCGGAA 

>GDRTRLIALANNT 
4 6765 ACGGTCGGCGGCCC^TCCGGCTGCCGCCAGCCCGTCACCCGGCCCGCGTCGCGGGCCGCGCCGGCATCCGGTCCTGCCGGCCCGGGGTGGCC 
4 6857 CGCCGTGCCATCCGCCAGGCGGCCACGCCTGCGACGGCGCCGGCCACGACACCGAACAGCGCGAACAGCACCGGAAGACCCGCGCTGACCAG 
4 6949 CAGCAGCACCACGTCGCCGAAGGCGACGAGC ATCCACAGTGCCAGCCCGGGTCGAGCGTCCCTGCGTCGGTGTCCC ATGTCGCACCTCCTCG 
47 041 CATCGTCCGGGATCAG ATACCCCGTTCGACGC AAGT AC ATGCGAATCG ACATCGCAACGGCGGGAATCGCCTGCTC AGCGGGCCG AGTCGGC 
47133 AATGGACGGCCCCGGCCGGGCGCTCAGTCGACCCGGTCCGGCTCGAAGC^ 

<• DVRDPKFGKAIRDFDALRAQWD 
47224 CCTTGT/TCGCCACCTCCCACCGXjAGGGCGTACCCCCGGTTGCTGGCCGTGACGAAGCCCCG^TTGCGGACGTGGA 

<KNAVEWRLAYGRNSATVFGRNRVHI RTGDR 
47316 TTCTCC AACCACTCCCAGTCCGCGCACGTCTTGTAGTAGTCGC AGCGCTTGATGCTCAGATACTGGTAGCCGTTGACGTAGTTCTTCCGGGC 

<N ELWEWDACTKYYDCRKI SLYQYGNVYNKRA 
4740 8 CG<3TTCCTTCTCTTTCCAGTCGGCGTAGGCGTCGCCCTCCGG<}GTGCTGGTCCACTGCACCAGCAGCTCGCCC ACCCCGTCGCGCTCGTCGA 

<PEKEKWDAYADGEPTSTWQVLLEGVGDREDF 
47 5 0 0 AGACGATCGTGTTCTGCCCCACGCTGCGCCGCACCCAGCCCTTCGGCAGGGGCAGCGAGAAGCCCGCCGGGTCCTTGTGC AGGAGCC AACCC 

<VITNQGVSRRVWGKPLPLSFGAPDKHLLWG 
4 7 592 TCGGGCAGGGCGTTCGGGTCCACGGACG<JGCTGGCAGACGX^^ 

<EPLANPDVSPSASPTPSPPAASSTAPAASPT 
47684 CGGCGTCGACACGCCCGCCTGCGGGTCGCCGCCGGGCCCCGGGTCGTCGTCACCGCTCCGGCCGAGCAGCGGCACCGCCGCCAGCAGGCCGA 

<PTSVGAQPDGGPGPDDDGSRGLLPVAALLGI 
47776 TCAGCAGCACCGCGACG AGCGCGCCGACC AGC AGGCCGCGGCGGCG ACGCTCGGGCTTCGTGCCCCCGACGACGGTGGCCCGGCCCGTCGAC 

< LLVAVLAGVLLGRRRREPKTGGVVTARGTS 
47 868 GAGAGTGCCGXjKjCCGGAGXSCGXXXTGG^AGCACCGACGT^ 

<S LAPGSAAPLVSTPAAEERPAVAPERAPAPP 
47960 CGAAACGGGCACAGCCGCGCCCGGATCGACCCGGGTGTCGTCGGCTCG<^TGTCGTCGGCCCGACCGGCCGCCGGGGCCCCGTCGGCCCCTT 

<SVPVAAGPDVRTDDARTDDARGAAPAGDAGE 
48052 CGTCCCCAGCCTCGGCGGCCCGGTCCGCCGGCGGCCCGTCACTCCCTCCGGCCGGCGTGCCCGCAGCCTCGTCGGCGCTGGTCGCGGCGTCG 

<DGAEAARDAPPGDSGGAPTGAAEDASTAAD 
4 8144 GGGGCGGG<3ATCTTCGCGGTGGGATCGGCCGCCTGCGGGCCGAAGTCGGTGACCTTCGCGGTCGGATCGGCGTCGGCGCTCGGCAGGTCGAC 

<PAPI KATPDAAQ PGFDTVKATPDADAS PLDV 
48236 C ATCGCGGTCGGCGCGGCGTCCACCCCCGCCGGGACCTTCGCGGTCGCGTCGCCGCCGCCCACGCCGGCCGCCGCCGCGCC ACCGGCGGC AA 

<MATPAADVGAPVKATADGGGVGAAAAGGAAL 
48328 G ACC ACCTTCCGGGGT AC GCGGTGCGGGCGGGGC CGCCGGCCGGC CGGTG CG CTGTTCGGCGGGGCGCGGTGCGGGC ACC AC CGGCGGTCGC 

<GGEPTRPAPPAAPRGTRQEAPRPAPVVPPR 
4 8420 GGCTCGCGCGG^CCGTTCGGCCCCG<^CG<5CGTACGCCGTCGAGCAGCGAGATG<n'CTTGGCGCGACGCCCGGCGGCCCGCCGCAGGAGCCG 

<P ERPGNPGPRRVGDLLSI^KARRGAARRLLR 
48512 CTCGGCCACCTCGGCGTCGATCCGCTCGGCCGGCTCCTTGC^ 

<EAVEAD IREAPDKRLLGNLVPKLPGANRP PP 
48604 GCATCGGCTCGGTGGCC AGCGCCGCCAGG^TGGCGATCGCCGACGGTCGGGCG AACGGCG ACTTGCCCTCCACCGCCGCGTAGAGCGTCGCG 

<MPETALAALTAIASPRAFPSKGEVAAYLTA 
4 8696 CCCAGCGACCAGAGGTCCGCCTCCG^CCGGCG^TGCCGTCCCGCGCCCGCTCCGGGGCGATGTACGCCGGGGAGCCCAGCACCATTCCGGT 

<G LSWLDAEPGATGDRAREPAI YAPSGLVMGT 
48788 CCGCGTCACGTTCGGGTCACCGGGGATGGTCGCC AGGCCGAAATCGGTCAGCACCACCCGGCCGTCGGTGCCGAGCAGCACGTTGCCCGGCT 

<RTVNPDG P I TALGFDTLVVRGDTGLLVNGPK 
4 8880 TGATGTCCCGGTGCATGACGCCGGCCTTGTGCGCCGCCTTCAGC GCGCCGAGCACCCCCAGGCCGATCTCGACCGCCTTGGCCGGCGACACC 

< I DRHMVGAKHAAKLAGLVGLG I EVAKAPSV 
48972 G^CCGTCCTCCGCGAGAGTGTCCTGAAGGGACTTCGACGCC ACGT ACTCCATG ACGATCCACGGGTCGCCGTCGGTGCGC AG AACGTCGAA 

<P GDEALTDQL SKSAVYEMVI WPDGDTRL.VDF 
49064 GATGCGGACCACl3TTGACGTGGTTGAGTCGCGCGATGGCACG<^ 

<IRVVNVHNLRAIARAERLSRERM2RREEPTL 
49156 GX3CTGGGCGX3CGG<3ACCAGTTCCTTGATCGCCACAT^ 

<SPPPVLEKIAVDRHLVEDRAKWVRGM 
49247 GAGCGGCGAAkTCAGCCGGTACCGGTCGGCAACGAGTTGGGGAAGCGCGTTC^ 
4 93 39 ACCGCCG^ACGCCACTGTGCGACGAAGGTCAAGTTCGCG^ 
49431 GGGGCGTTCCGACCGCCGACX3AACTGGCCGCGCTGGTCGGC^ 
49523 TCGX3CCT03GCACG<:AGCG<rrCGGCCCGGC 
49615 TCtX!CAGXlATGTGGCCAGTG?rGX3CCCGCCGCCGAGGCGG<3 
49707 GACGATGCCGTGACGGATAGGGAGGGTCGATGATTCCCGAGGAGGACCGA. 
49799 ATCCGCCAGATGCGCGAGTTCGCCGACAAACTCCAGXSCGGAGG^ 
49891 GG<^CAGATCCCCAACCCCGCCGACGCCTTCGTCGAACTGG^ 
49983 TCTGGTCCX3TCGGTGGGGCCACCGGCCACCIXX;CCACGGC^ 
5007 5 CGCGTCTCCGACGTGGAGCGG^CCCTCGCCAACCCGGGCGCG<X:GGGGCA 
50167 CGXK?GX3G^CGGGK3CXXXjftXnXXrTG^ 

50259 CATGTGGGCGTGCATCCAGCIACCACGAGACCGCCGGGCACTGXI^ 
50351 TGGGCCGCCTCAAGGAGTACCG<3CGCG<SCCTCX3CCGAGGCGTCX^^ 

50443 GACCTGATCGACAAGGTGCAACGCACCCACGACGCGGCTGCCGCGAACTACGACGCCCTCGCGGCCGCC 

5053 5 CACCGAACTCAAGCCGCTACACGACGAGT ACGTGG AGAAACTCCAGCAGAAGCG<X3CGTACGAG<K:GACTACCGCCGATCCGAAGGCGCTGA 

50627 TGGGCAGCCG<;CTGCCGGAC^GCCGGTGACTGACGCCGACCTGGAA 

50719 GAACTCCAACAG^CCAAGCAATGCTCCGCCAACCCCCTCCTCCGCACOSCCCAG^ 

50811 AGGCACGCCTCCGOTCATCCCGCCCGTATTACCAGTCCCAACC^^ 

5090 3 TTCCTGTCCAGAGCGCACCCCAAAC AG^TCCTGTACTCG<XXX3A^ 

50995 ACCGX5CCACCCAGTGACA7X^CAACCCGCCAG<SAG{3TGCTG<3CCCTACCCrrC 

junction marker 
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51067 CAGCACAACCGGCCAGGTAGGAAACCAGCCCCGCTCAA^ 

51179 TGGGGATTAATCAACCCCTTGGCAATACTCCTCCG A 
512-71 ACCGGCGCTGCAGGCACTCGCCCGGGCTCGGGACG 

513 6 3 CGTAAACACCTTAGGTCTCAATGGTGAGCCGTCACGCTCATC 

514 55 CATGGGCAACCAGCGAG<K3CGTTCCTCCAGTAGTCCGCCCGCCCGAGGCGGAGGGACCAATCGACCCAGGGCCAGCTATTGGCTTCGAC CGG 
51547 TGAAAAACGCTCCTCCAGCGGGCGCTACCTTCACCTTCCTAGC^ 

> V R A 

5163 7 GTCCTGAAACCGATGGTGCCTTGCGAATACGGGCTGACCAGTGGCACCTTAACTACCTGAAG^ 

>SPETDGALRI RADQWHLNYLKAAEAQKLS LG 
51729 GAAGGGGTTGTAGTAGCG<;TCCCG<5ATACTG<5CGTTGATCC AC ACCCCGACCTTCAGCGCAATCTAATC AAAGGG ATTGAC ATCATTCCCGG 

>EGVVVAVPDTGVDPHPDLQRNLIKGIDI I PG 
51821 GG<5CAATX^AGATGGCCAGAAAGATCGCAACAGTCACGGCACTAGCATGGCTGGACTAATCGCAGCCCACGGACAGGGCCAGAGCGGCGCCT 

>GNGDGQKDRNSHGTSMAGLIAAHGQGQSGA 
51913 TAGGCATAGCACCCAGAGCCAAGATCATGCCAATCCTCTCTTCCGCGTCGA^^ 

>L GIAPRAKIMPI LSSASNNLGDADGLAAG I E 
52005 TTTGCAATCTCGCATGGGGCGGATGTCATCAATGTCTCCAGCGGAGGCGGCGCCAGCGTTCGACTCATCAAGGCA^TCAGAGAGGCGGTCGC 

> F A I SHGADVINVSSGGGASVRLI K A I R E A V A 
52097 CGCAGACATTGTAGTTGTCGC AGCCGCCGGAAAC AGTCCCGAAGACATGAC AATTGGCTATCCAGCCAGCGAGGAAGGAGTCGTCGC AGTTG 

>ADIVVVAAAGNSPEDMTIGYPASEEGVV A V 
52189 GCGGAATTG ATCGACAGGGAGAGCATGCTTCAGTTT CTGTCGTCGG ACCGG AAGTTGACTTAGTCGC ACCGGC AGTCGAC ATCTACAGC ACC 

>G G IDRQGEHASVSVVGPEVDLVAPAVD I YST 
522 81 AGTTACGACGGGAAGTACTCCAAAGGC ACCGGTACGTCC AGTGCC ACAGCGATAGTCGCAGGGGCCGCTGCTCTGGTCCGATCGAAGTTTCC 

>SYDGKYS KGTGTSSATAIVAGAAALVRSK F P 
52373 CGACCTGCCCGCCTCGGAG<3TCGTCCATCGCCTCACTGCCACCGCCATCGACAAAGGGCCGCCAGGGCACGACG ACCAGTACGGCTACGGCG 

>DLPASEVVHRLTATAIDKGPPGHDDQYGYG 
524 65 TTATCGACCTGGTTGCCGCGCTTACGGCAGACGTACCCCCGGTGGGCTTTGAGTCGGCGACGGCGGACGTGCCCGACGTGCCTGGGTCGACC 

>V IDLVAALTADVPPVGFESATADVPDVPGST 
52557 ACGACGGCGGTCGCCGAGCCGGCAGGCGAGGGTG ACGATG<XSGCAACGGCCCGAGGTCTGGCCACGTTGGGAGTGATCGTGGCTGCTGCGGG 

>TTAVAEPAGEGDDGATARGLATLGVIVAAAG 
5264 9 CGCTTGGGCGCTGGTCGCTCGACGGCGTAGGTTGAGCGACGACCCGCCCCCGCGGATC AGCCGGTGACC AGCC AGCCCTG ACGCCATGTCG 

>AWALVARRRRLSDDPPPRISR 
52740 GCGACATTGG^GGGTCGGGGTGGGTGGGATACCGCTATTTGGCC^ 

52 832 GTCGGTCGACGGCCAGGCGGCGTCGACGGAGGCGGG<^AAGCGGTAGCGTCG^CGACGTGCCGAACTCGATTTCACTCCGACTCGTGCTCGC 

> V P N S I SLRLVLA 
52924 GTCGGCGAGCCCTGCCCGTCGCAAGCTCCTCCACGCCGCCGGCATCGAACCCG ACGTGCTGGTCAGTGGGGTCGACGAGTCCCAGGTG ACC A 

>SAS PARRKLLHAAG I EPDVLVSGVDES QVT 
53016 GCGAGCGAGCCGAGGATCTGTGCCT<X3AGCTGGCCCGCCTGAAGGCGCAGGCGGTC 

>SERAEDL,CLELARLKAQAVVGRLRPSADERT 
53108 CTGGTGCTCGGCTGCGACTCGGTGCTCGCCTTCGACCGCGAG ATTCTCGGC AAGCCGGCCGACG AGGCGG ACGCTACCCGGCGTTGGG AGCG 

>LVLGCDSVLAFDREILGXPADEADATRRWER 
53200 GATGCGGGGGGGCAGCGGGGTGCTACACACCGGGCACTGCCTGATCGACGTCATCCACGAGACGCGCGCCG AGGCGGTCGCCTCGACCACCG 

>MRGRSGVLHTGHCLI D V IHETRAEAVASTT 
53292 TGCGTTTCGCTGACATCAGCGACGAGGAGATTCCCGCGT^ 

>V RFADISDEEIAAYVATGEPLAVAGAFTI DG 
5 33 84 ATGGGCGGGGCGTTCCTGGAGGGTGTCGACGGCG ACCCGGGCACGGTGGTCGGCCTCTCCCTACCGTTGCTGCGCCGGCTTCTCGGCGAGCT 

>MGGAFLEGVDGDPGTVVGLSLPLLRRLLGEL 
5347 6 GGACCTGCGGATCATCGACCTGTGGACGAAGGTCGCGCCGGGCGGCCAGGCGGTCGAGGCGGTGGGT ACGGTCC AGCC ATG ACG ACG AAGT 

> M T T K 

> D L R I IDLWTKVAPGGQAVEAVGTVQP • 
53567 CCCTGCCGCTGACCCCGGAACTGCATGCGTACGTGGTGGCCCACGG ATCGGACCCGGACGAGGTGATGCGGGATCTGATCG AGGAGACCCTC 
>S LPLTPELHAYVVAHGSDPDEVMRDLI EETL 

53 659 GCCGCGCTGCCCGCCGAC^CGAGGATGCAGGTG^CCCCGXJAGCAAGCCGCCrre^ 

>AALPAEARMQVAPEQAAFLTFLTRLIGARRA 
53751 G<nXMAGGTGG<X:ACCTTCACCGGCCTCT<OT 

>VEVGTFTGLSSLAIARGLA£GGRLTCFDI S 
53843 AGGAGTACACGGGCGTCGCGCGGCGX3TACTGGX3CGC^ 

>E EYTGVARRYWARAGVADQ I DLR IG PAGDTL 
53935 CGCGGX3TTGCCGTACGAACGK3CACCTGGACTTCGCGTTCATC 

>RGLPYERHLDFAFIDADKVGYPVYWAELVPR 
54027 CATGCTCCCGGGCGGGXjTX^ATCGCGGTGXjACAACACGTTGCGCGGGGGCCGGGTGCT^ 

>MLPGGVIAVDNTLRGGRVLAPRDADDRAI A 
54119 CGTTCAACGACGAG<5TGATGGCCGACGTCCGGGT^ 

>A FNDEVMADVRVEPVLL.P TADGLTLARVR • 
54210 GGCGCCAGCCGGACGATCGTGCCAG<nx:GCCGGCCCGGCGTCGTGTGT^ 
54302 AGGAAGGGGCCCITCCTATACCGAATGCGTTAACAAGGC^ 

<• RVSRAFQRA 

54393 GCCCAGGCGACGCOGACGGCGGC AAGCACCGCGAT<IATGG^ 

<a wavgvaal.vai 1tlgqwvkdngldgaflar 
544 85 ggtgccgtccacggcccaggagaacg<k;ttccactcggcgatgcgctggagccagccgggggcgaaggtgagc 

<TGDVAWSFPNWEAIROLWGPAFTLPLLIGSL 
54577 GCAGCAGCACCG<3CTGG<^GACGGTGrrCATCACCGXXKK:GAGCGCGTCCTCAC^ 

<LLVPQAVTNMVPALADESKVKLAVGYSVAS 

54 669 GTCATCAGCGCGATCAGGGCGAGC ATC AGGTACGCC AGC AGC AGGTCGCCGATG AAC ACGCGC AGCTCGAAC AGGAGCGCGAGC AGGGTGAT 

<TMLAILALMLYALLLDGIFVRLEFLLALLT1 
547 61 GATG ACGGCCTGGGCGAGCAGCGACACGACGTCGCGC AGGGCCa^CCGAGCAG<^GCGCGAGCCG<XrKS ACCGGGGTG ACCCGGG ACCC^ 

■cIVAQALLSVVDRLARGLLLALRSVPTVRSRE 
54853 CGATGACGCCGGCGCGCAGCTCGGCGATCAGGCOSAAGCCCTGGAAGAGGCCGCCG 

<IVGARLEAILGFGQFLGGFIALLVLLGPVF 
54945 ATCTTGTACGCCTCG^CTCGGTCGGCGCG 

<IKYAEAQTPANLAPKLLPAFLLLYMVPQFVG 
55037 GACGAAGACCCAGACCGGATTGCGG AGC AC^GTTGCATCTGGCGCTGGGCGACGAGCC AGGTGTCGCGGGCG AACTTCATGATCGG ACT 

<VFVWVPNRLLLQMQRQAVLWTDRAFKM 
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55127 CCGGGTGGTCAGGACTCGCGCAGCGAGCGGCCGGTCTTG^ 

<. SERLSRGTKTLFVDDLSPRHLEI SSLR 
55218 TG AGGCCGG ACTGGTCGAGCC GGCGCAGG ACCTGC GGG ATGGCGGTGGCC C CCTCG TCGACGGTC AGGCGC AGG C C G CC G CC GTC G AC G GTT 

<LGSQDLRRLVQPIATAGEDVTLRLGGGDVT 
55310 TCCAGCTTGGTGACGTACGGCTCGGTGTCGAGCAGTTGGGCGGCCTGCG« 

<E LKTVYPETDLLQAAQPTAAADLGVLLVEGS 
55402 GATCTCCCGCTTCAGCCCGCCCGGCGTACCCTCGGCGACCACCTCGCCGTGGTCCATGATCGCGATCCGGTCGCAG AGCGCGTCGGCCTCGT 

<I ERKLGGPTGEAVVEGHDMIAIRDC LADAED 
55494 CCAGGTAGTGCGTGGTGATGAAGACGGTCATCCCCTCGGCGCGCAGCCGACGGATCTCGTCCCACATGTGGGCGCGACTC 

< LYHTTI FVTMGEARLRR I EDWMHARSQPDL 
55586 CCGCTGGTCGGCTCGTCCAGGAAGACAATGCGGGGGTCGTGG ATG ATGCCG AGAGCGATCTCGACGCGGCGGCGCTGGCCGC CGG AGTAGGT 

<G STPEDLFVI RPDH I I GLAI EVRRRQ.GGSYT 
55678 CTTGCACTTACGGTCGGCGTACTCGGTGAGCTGGAAGGCGGCCAGTGCGCGCT 

<KCKRDAYETLQFAALAREARRLADAKG I GYM 
55770 TCCGGGCGTGC AGG AC C AGTTCCTCGCGGGCGGTGG AGTCGTC CC AGGTG CTGC CGCCCTGGGCGAC AT AG C C G ATC CGG CGACGCACC TC G 

<RAHLVLEERATSDDWTSGGQAVYG I RRRVE 
558 62 GCCGG<3TTCCGCAGCAGGTCGGCCCCGGCGATGGTGGCCTGGCCGCCGTCGGGGGTGATGAGGGTGGCCAGCAT^ 

<APNRLLDAGAITAQGGDPTILTALMRLTTTK 
55954 CCCGGCGCCGTTGGGGCCGAGGAACCCG AAGATCTCCCCCTCGGCG ACGTCCAGGTCGACGCCGCGCACGGCGTCGACCGTCTTGTGCTGTC 

<GAGNPGLFGF I EGEAVDLDVGRVADVTKHQR 
56046 GACCGGCGCGGGAGCGAAACGACTTCCGCAGCCCTCTGGTCTGGATCATCTTCGCTCCTGGTCGTCCTTAGCCGGACCGGGCCGGCCCTC 

<GARSRFSKRLGRTQ I M 
56136 TCTCCGGGACGCC ACGCCACACGGGTGGCCCCGAACGTCGCGCCG AGGCT AACGCGATATAACTTCTCTAGTC AACTTTG ATTAATGGCGA 

< - RS IVERTLKS * HR 
56227 CCGTCGGCCCCCTCCCCCACGTTCCAGCCGTCCTGACTGGCCAACCCT^ 

<G DAGEGVNWGDQSALGEPLY PVGAE I RDAVR 
56319 CTCACACCAGGCCACCTCGACCTCTCCCCGGGCAATCCACAGCTCGTACATCCAGCTCACGCCGACCGGCTTGGAGTCGCGG ATCC AGG AGG 

<ECWAVEVEGRAIWLEYMWSVGVPKSDRIWSS 
5 6411 ACTTCCATCGAGGCACGCATGGTTT^GACACTGGCCCCXTAGCACCTGCCCCCGACT^ 

< EMSARMTEVSARLVQGRSRLAAVAEPR PLA 

5 6503 GGCAGGAACGCGAACGCCGCCACGAACGGATCGCTCGTCTGATGATTGCCCCACCACAGGCCGCGCAGCAGCGTCTCGAACTCGTCGACCCC 
<PLFAFAAVFPDSTQHNGWWLGRLLTEFEDVG 

56 5 9 5 CTTCGGGGTGATCTCGTACGTCGTCCGCGCCCGCCGGGCGCCGACCTGCTCGGTGGCGACCTCGCGGAGCAGCCCCTCCTCGCCGAGCTTGC 

<KPTI EYTTRARRAGVOETAVERLLG EEGLKR 
56687 GCAGCGCGTGGTAGATCGAGCCGGGCTGCACGTTGGCCCACTTGTCGGCACCCCAACTGAGCAGCTCGCGGCGGACGTCGTAGCCGTGCACC 

<LAHY I SGPQVNAWKDAGWSLLERRVDYGHV 
56779 GGCTGCATCCACTTGACCAGGCCGAGAATCATCATGCGAGTGGCAGACACCGGAAAAGCGTATTAGACAAGTTTGACTA 

<P QMWKVL.GLIMM 
56870 GGCAGTGCCTCATCCC ACACTGAGCGATCGTT AGGGCC ACG ACGCCGGCCGAT AAACTCCCCGTCAGTAACATCCCGGG AGGAGCCACG AG 
56961 GTGCGCAAGGTACTX)ATCGCCAACCGAGGCGAGATCGCCGTCCGCGTCATCCGCGCCTGCCGCGACGCCC<5CCTGGGCAGCGTCGCCGTCT 

>VRKVL IANRGE I AVRV I RACRDAGLGSVAV 
57052 ACGCGGACTCCGACCGGGACGCCCTGCACGCGACCCTGGCCGACG AGGCGTACGCCCTGGGCGGCGACACCGCCGCCGAG ACGTACCTGCGG 

>Y ADSDRDALHATLADEAY ALGGDTAAETY LR 
57144 ATCGACAAGCTGATCGCCGTCGCGGCACAGGCCGGGGCCGACGCCGTCCACCCCGGGTACGGCTTCCTCGCCGAGAACGCCGACTTCGCCCA 

>I DKL IAVAAQAGADAVH PC YG FLAENADFAQ 
57236 GGCCGTCCTCGACGCCGGGCTTACCTCGATCGGCCCGACCCCACAGGCGATCCGCGACCTGGGCGACAAGGTCACCGCCCGGCACATCGCCC 

>AVLDAGLTWIC-PTPQAI RDLGDKVTARH I A 

57 328 AGCGGGCCGGCGCGCCCCTGGTTCCCGGTACCTCGGACCCGGTCGGCAGCCCGGACG AGGTG ATCGCATTCGCGGTCGACCACGGCCTGCCG 

>QRAGAPLVPGTSDPVGSPDEVIAFAVDHGLP 
57420 GTCGCCATCAAGGCCGCCTTCGGCGGCGGCGGGCGCGGCCTC AAGGTGGCCCGC ACGATGGAGGAGATCCCGCACCTGTTCGAGTCGGCCAC 

>VAIKAAFGGGGRGLKVARTMEEI PHLFESAT 
57512 CCGGGAGGCGGTCGKTGGCGTTCC^CCC^GGCGAGTCTTTCGT^ 

>REAVAAFGRGECFVERYLDQPRHVEAQVLA 
57604 ACCAGCACGGCAACGTGATCGTCGTCGGCACCCGGGACTGCTCG<TTGCAACG 

>DQHGNVIVVGTRDCSLQRRHQKLVEEAPAPF 
57696 CTCACCGACGCCCAGCGCCGGCAGATCCACGACAGCGCCAAGGCAATCTGCCGGGAGGCCGGCTACCAC 

>LTDAQRRQIHDSAKAICREAGYHGAGTVEYL 
57788 GGTGGGCACGGACGGGACGATCTXTCTTCCTTGAC^TCAACACCCGCCTG^ 

>VGTDGTI SFLEVNTRLQVEHPVTEETAGID 
57880 TCGTCCGCGAGC AGTTCCGGATCGCCG ACGGCGAG AAGCTGCGGCTGGCCGAGG ATCCGACCCCGCGCGGGCACTCC ATCG AGTTCCGG ATC 

>L VREQFRIADGEKLRLAEDPTPRGHSI EFRI 
57972 AACGGCGAGGATCCGGGCCGCAACTTCCTGCCCGCCCCCGGCACCGTX:ACCGCGCTGCGGCTGCC 

>NGEDPGRNFLPAPGTVTALRLPTGPGVRVDT 
58064 CGGCATCTCCGCCGGCGACGTGATCGGCGGCAACTTC^ 

>GISAGDVIGGNFDSLLAKVI ITGETRTEAL 
58156 AGCGGGCCCGGCGGGCGCTGGACGAGATGGTCGTCGAGGGAATGGCCA^ 

>E RARRALDEMVVEGMATALPFHRLVVRDPAF 
58248 ACCGCCGCGCCGTTCACCGTGCACACCCGGTGGAT^^ 

>TAAPFTVHTRW I ETEFDNTVLPFTAAAG PA E 
58340 GGGCCCGGCCGAGCGGGAGACCGTCGTGGTCGAGGTGGGCGGC^ 

>GPAERETVVVEVGGKRLEVTLPAGLGAGTA 
584 32 CCGGGCCCGCCGCGCGGAAGCCGGCCCGGCGGGGCGGCGGGGCCAAGGCCGGCGCGGCGGTCGGCGGCGACGCCCTC ACCTCTCCGATGCAG 

>A GPAARKPARRGGGAKAGAAVGGDALTSPMQ 
58524 GGC ACG ATCGTG AAG ATCGCCGTC GCGG ACGGGG A C AC C G TCGC C AAGGGCG AC CTGGTC GTCGTGCTGG AGG CG ATG AAG ATGG AGC AGCC 

>GT IVK I AVADGDTVAKGDLVVVLEAMKMEQ P 
58616 GCTGCACGCGCACAAGGCGGGCACGGTCC^CGGGCTGTCCGCC^^ 

>LHAHKAGTVGGLSAEVGAVLAAGAPICTIT 
587 08 GAGGTGCAAGGAGGGGCCCCCTCTTAACGCATTCGGTATAGGAAGGGCCCCTTCCTAACCACGCGCCCGG 

> • 

58800 T ACGCGT ACCGGCCG<3GGGTGTT/I^CCGCG ACC ACCGCGAGCGGTG AGGAC CGG AGGTGCGG TTC CT AC ATGGC 

> > V R F L H G 

58891 GCGGTTCCCGCGCACGACCTGACCTACAACGACGTCTTC 
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>AVPAHDLTYNDVFMAFNRSEVGSRLDVDLAT 
58983 CTCCGACGGCACGGGCACCACCATCCCGCTGGTGGT 

> SDGTGTT I PLVVANMTAVAGRRMAETVARR 
59075 GCGCACTCGCGGTGATCCCGCAGGACATCCCGATCGAGGTGGTGGCCAACGTGGTCGCCTGGGTCAAGCAACGGCACCTGGTGCACGACACG 

>G A L A V I PQDI PI EVVANVVAWVKQRHLVHDT 
59167 GCG ATC ACGCTCG<X:CCCACCGACACCGTCGGCGATGCCATCCATCTG 

>AITLGPTDTVGDAIHLLPKRSHGAVVVVDEA 
5925 9 CGGTCGGCCGCTGGGCGTGGTGACGGAGGCGGACACCGTCGGGGTGGACCGCTTCGCCCAGCTCCGCCACGTGATGTC^ 

>GRPLGVVTEADTVGVDRFAQLRHVMSTELH 
5 93 51 CGGTGCCGGCGGACGCGGACCCGCGTACCGGATTCGACCGGCTCTCGGCGGGCCGGCGGCGGCTCGCGCCGGTGGTGGACGGCGACGGCCGG 

>T VPADADPRTGFDRLSAGRRRLAPVVDGDGR 
5944 3 CTCGTCGGGGTGTTGACCCGCAAGGGCGCGCTGCGCGCGACCCTCTACACCCCGGCGGTGGACGACCGGGGCCGGCTGCGG ATCGCGGCGGC 

>LVGVLTRKGALRATLYTPAVDDRGRLR I AAA 
5 953 5 CGTCGGCATCAACGGCGACGTCACCGGCAAGGCCGCCGCGCTGCTGGAGGCCGGGGTCGACGCCCTGGTGGTGG AC ACCGCGC ACGGCC ACC 

>VGINGDVTGKAAALLEAGVDALVVDTAHGH 
59627 AGGCGCGGATGGTCGCCGCGCTGCGGGCGGTGCGCAAGCTTCACCCGGGCGTTCCGGTCGCGGCCGGCAACGTGGTC ACCGCCG ATGGGGTA 

>Q ARMVAALRAVRKLH PGVPVAAGNVVTADGV 
59719 CGCGACCTCGTCGAGGCCGGCGCCGACATCGTGAAGGTGGGCGTCGGTCCGGGCGCGATGTGCACCACCCGGATGATGACCGGGGTGGGGCG 

>RDLVEAGADIVKVGVGPGAMCTTRMMTGVGR 
59 811 TCCGCAGTTCTCCGCGGTGCTGGACTGCGCGGCGGCGGCGCGGGACCTCGGCCGGCACGTCTGGGCCGACGGCGGGGTACGGCACCCGCGCG 

>PQFSAVLDCAAAARDLGRHVWADGGVRHPR 
5 99 0 3 ACGTGGCGCTGGCCCTCGCCGCCGGCGCGTCGAACGTGATGATCGGTTCCTGGTTCGCCGGCACGTACGAGTCCCCGGGTGACCTGTACACG 

>DVALALAAGASNVM I GSWFAGTY ES PGDLYT 
59 995 GACGCGGACGGCCGGAGGTACAAGGAGAGCTTCGGGATGGCCTCGTCGCGGGCGGTCAGCGCGCGTACGGCCGAGGACAGCGCGTTCGACCG 

>DADGRRYKESFGMASSRAVSARTAEDSAFDR 
60087 GGCCCGCAAGGGGATCTTCGAGGAGGGCATCTCCTCGGCCCGGATGTACCTCGACCCGGATCGCCCGGGCGTCGAGGACCTGATCGACGAGA 

>ARKGIFEEGISSARMYLDPDRPGVEDL IDE 
6017 9 TCATCTCCGGGGTACGCAGCGCGTGCACGTACGCGGGCGCGCGCAGCCTGGCGGAGTTCGCGGAGCGGGCGCTGGTCGGGGTGCAGAGCACG 

>I I SGVRSACTYAGARS LAEFAERALVGVQST 
60271 GCCGGCTACACCGAGGGGATGCCCCTACCGACGAGTTGGTGACCCCGCGCCCGGCGGTGAGAAGGGTTCCCCTCTCTACCGGAGGCGTCAA 

>AGYTEGMPLPTSW* 

603 62 C AAGGGGCCCTTCCTTCGTGCGCGGCTGGGTATCGGCGTGACCGACTGCCGCACGCGCCGCCGC ACACTGAGCCGCCGCCCGTCGAGGGCCC 

604 54 ACCGAACGGGCGCCGGGGTCAGTCGAAGAGGCGACGGATGACGGTCCGGGCGGCGGCCTCCGGGTCCGGGCCGGTGCCGGGCGGGAGCGCC 

< . DFLRRIVTRAAAEPDPGTGPPLA 
60545 CCGGCC AGCCAG AGTGTCACGAAGCCGTGCACG ATCGACCAGGCGGCC AGGGCGTCCGCCTCCTGGTCCGGGTCGGTTTCCCGGCGCGGG AG 

<G ALWLTVFGHVI SWAALADAEQDPDTERRPL 
60637 GGCGGCCACCCCGGCCCGCAGCGCCGCGCCGGCCCGATCACGGGCGGCTGTCACCTCGGGGTCGTCGCGACGGTAGAGCTCCGGGCGGAACA 

<AAVGARLAAGARDRAATVEPDDRRYLEPRFM 
60729 TCACCTCGAAGTGGGCCCGGTGGTCGACCGCGAACCGG ACGTACGCCACGCCGGCGTCGAGCAGGTCGCCGGCCTCGC AC AGCGCC CCGGCC 

<VEFHARHDVAFRVYAVGADLLDGAECLAGA 
60821 AGCAGGTCGAATCCCTCGACGGCGAGCGCGGTGAGCAGCCCCGCCTTGTCGCCGAAGTGGTGCGCGGGGGCGGCGTGCGAGACCCCGGCCCG 

<LLDFGEVALATLLGAKDGFHHAPAAHSVGAR 
60913 GCGGGCCAGGTCGCGCAGGCTCAGGGCGGCCGGCCCGGCGTCGGTGATCGCGTCGACGGCGGCGGCGAGCAGGGCGCGACGCAGGTCACCGT 

<RALDRL SLAAPGADT I ADVAAALLARRLDGH 
61005 GATGGTAGCCACGCGGTCCGGTCATGCCGGCAGCCTAACTTGTCATTGACAAGATAGCCAGGCCGAAGCAATCTAGGCAATGACAAGTTG 

<HYGRPGTM 
61095 CCTTCGACCGAGGAGAACCCCGATGGCGCCCCTGATCGCTCTCATCGCCGGCTCGGCCCTGGCCCGACTCGCCGGGCTACTC AACGTCGACG 
61187 CCCTGGCCGGCTGGC ACCCCGCCCTGCGGG1 CGGGCTCGCGGCCATGTTCGCGCTC ACCGGG ATCGCCC ACTTCACCTCCCG ACGGCCCGAC 
61279 CTGGTCGCCATGGTGCCGCCCCGACTGCCCCACCCGGGGCTGCTGGTGACCGTCACCGGCCTGCTGGAGTTGGCCGGCGC 

>MVPPRLPH PGLLVTVTGLLELAGAVALL 
61371 CGTCCCCGGC ACGGCGCGGTGGGC AGCGGCCGGGCTGGGGCTGCTGCTGCTCGCGATGTTCCCGGCC AACGCCTCGGCCGCCCGGCGCGGGC 

>V PGTARWAAAGLGLLLLAMF PANASAARRG 

614 63 TGACCCTGGCCGGCCGGCCGGTGACCCCGCTCGTCCCCCGCGCGCTGCTCCAGGTGATCTTCCTC ACCGCCGCCGCCGCGATTTCGTTTGGG 

>LTLAGRPVTPLVPRALLQV I FLTAAAA I SFG 

615 55 CCCTGACTATCAGGGAGCTAACATGACCCGCATGGAGCTGATAGGGCT^ 

> p . >VPLGRLLVTAGH 
61645 CGTCGTCGGCCAACGGTGGAACCGCTACCTCGCCGAGGAGCACGGCCTCACCCAGGCGG^ 

>VVGQRWNRYLAEEHGLTQAGMVTLMTLARH 
61737 GCGAGCTGCCGC ACCG<K3CGGTCGCCGAGGCGTGCTTCATCCGC CCGGCC AC CCT AACCGGCATCGTCGACAC ACTGGAGCGCGACGGCCTC 

>G ELPHRAVAEACF I RPATLTG IVDTLERDGL 
61829 GTCGAGCGGCAACGCGACGACGTCGACCGGCGCAGCGTGCGGCTCGTCCTGACCCCCGCCGGTCGGGA^ 

>VERQRDDVDRRSVRLVLTPAGRERVAALTNV 
61921 CATGCAGTCCGGACGACCGATGACCTCGGTCGACGCCXJACCCGGCGAAGGCCGCCGTGATCCGGCAGTTCCTGC 

>MQSGRPMTSVDADPAKAAVIRQFLLEV IGS 
62013 GAGAGGAACCTCGGGTGACGGCCCTCGACGCGAGGCCXX^GGCTC 

>G EEPRVTALDARPEAPAC 
62105 GACCGCTGGCGGCGGTGATGGCGTTGCAGTTCGTCGGCACGATGGCCTC^ 

>MALQFVGTMASLYLPSLNADI IDQG 
6219 6 TGTGGCCD3GGGCGACACCGGCTACATCATGCGTACGGGCGGCTGG 

>VARGDTGYIMRTGGWMLLVSLVQIACSTAA 

622 88 tcttcctcggcgcgcgctcc<x:gatgggcttcggccggg^ 

>VFLGARSAMGFGRDVRAEVFAHVNRFSAREV 

62 3 80 acccgcttcggcgcaccctosctgatcacccgcaacaccaacga 

>TRFGAPSLITRNTNDVQQVQMLVLMSCTMLV 
62472 CGCCGCGCCGATCATGAGCGTCGGCGGGGTGTTCATGGCACTGCGGGAGGACGTCGG^ 

>AAPIMSVGGVFMALREDVGLSWLMLVSVPA 

62564 TGGCGATCGCCCTGATGCTGATCATCCGGCGGATGGTGCCCGG^ 

>L AIALMLI IRRMVPGFRL.MQTRIDAVNRVLR 
62656 GAGCAGATCACCGG^TCCGOTTGGTCCGG^GTTCGTCCGCGAGCCG 

>EQITGIRVVRAFVREPYETARFGRANADLTA 

62748 GACCGCCCrTGCGCACCGGTCGGTTGATGGCCCTGATCTTCCCC^ 

>TALRTGRLMALIFPVVTLVLNVSSVAVLWF 
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62840 GCGCGGACCGCGTCGACGCCGGCCAGATCCAGGTCCXKTGCGCTCACCGCCTTCCTGC 

>ranRVDAGOIQVGALTAFLQYLMQILMAVML 
62932 GCCACCTTCATCCTGATGATGGTCCCGCGCGCCGCGGTCTGCGCCGAGCGGATCGTCGAGGTGCTCGACACCGACTC^ 

>ATFILMMVPRAAVCAERIVEVLDTDS 1 v 1 f V 

6 3024 GGCCGCGCCGACGGCCGAGGTCACCGGCCGGGGCGAACTGGAA^ 

>AAPTAEVTGRGELELRGVRFQYPGASAPVL 

63116 ACGACATCTCGrrCCGGGCCACGCCCGGGCGCACCA^ 

>wnTSFRATPGRTTAI IGSTGAGKTTLLTL.I P 

63208 CGGCTGATCGACGCCACCGCCGGGGCGGTG^ 

>rt TDATAGAVLVDGVDVRDLAPDDLWRRIGL 

63300 GGTCTCGCAGCGGCOGTACCTGT^^ 

>VPORPYLFSGTIASNLRYGNPDATDAELWA 

63392 CCCTGGAGATCGCCCAGGCGCGCGACTTCGTCGCCGA^ 

>ALEIAQARDFVAELPEGLNAPITQGGTNISG 
6 34 84 GGGCAGCGCCAGCGCCTCGCGATCGCCCGGGCCCTGGTCCGCAAGCCGGAGATCTACCTGTTCGACGACTCGTTCTCGGCGCTCGACCTGGG 

>GORORLAlARALVRKPEIYLFDDSFSALDLG 
63576 CACCGACGCCCGGCTGCGCGCGGCGCTACGACCGGTCACCGCGGACGCGACGGTGCTGATCGTGGCCCAGCGGGTCTCCACGATCGTCGACG 

>TDARLRAALRPVTADATVLIVAQRVSTIVD 
63668 CCGACCAGATCATCGTGCTTGAGGACGGGGGCATCGTCGGGATGGGCCGACACGCCGAACTACTGGAAGACTGCCCGACGTACGCGGAGATC 

>ADOI IVLEDGG IVGMGRHAELLEDC PTYAE 
63760 GTCGCCTCCCAGCAGACGGCG^X^GTGCCGGC 

6 3 8 51 > GCGGCTGCCCTCCGGCAACCAGG^CAGCGGCC^ 

6 3941 CCGCCGGCTGCTGCGCCGGCTGCGACCGCACCGCCTCCAGCTGGCCGCCATCGTCCTGCTCTCGCTGGTCAGCGTCGGTTGC AACGTGTACG 

>RRLLRRLRPHRLQLAAIVLLSLVSVGCNVY 
64033 c^CGAAGGTGCTCGGCCACGCCACCGACCTGATCTTCAGCGGGGTGATCGGCCGGCAGTO 

>GPKVLGHATDLI FSGV IGRQLPAGTTAEQAV 
6412 5 GCGGCGGCCCGCGCGGCCGGTAACGACAGCTTCGCCGACATGCTGGCCCGGATGGACGTGGTGCCCGGGGTGGGCATCGACTTCACCGCCCT 

>AAARAAGNDSFADMLARMDVVPGVG I DFTAL 
64217 GGGCCGGGTGCTGCTGTTCGTGCTCGCCCTCTACCTGGCGGCC AGCGTGCTGTTGTGGTGGC AGGGCTGGCTGCTCAACGGGGTGGTGC AGC 

>GRVLLFVLALYLAASVLLWWQGWLLNGVVQ 
64309 GCACGGTGCTGCGGCTGCGCGCCGACGTGGAGGACAAGCTG AACCGGCTGCCGCTGCCGT ACTTCGACCGGCAGCCCCGGGGCGAGTTGCTC 

>"RTVLRLRADVEDKLNRLPLPYFDRQPRGELU 
64401 AGCCGGGTCACCAACGACATCGACAACATCTCGCAGAGCCTCCAGCAGACGCTGAGCCAGCTGCTCACCTCGCTGCTCA 

■>c;rVTND I DNI SQS LQQTLSQLLTSLLTVVGV 
64493 aCTGGCCATGATGTTCTGGATCTCGCCGCTGTTGGCGCTGGTGTCCCTGG^ 

^ T AMMFWISPLLALVSLVAVPMSVVVTSLVA 

64585 AGCGGTCACAGCA^TTCATCGCCCAGTGGACGCATACCGGAGAGCTGAACGGCCAGATCGAGGAGGCGTTCACCGG 

pcnORFIAOWTHTGELNGQIEEAFTGHh-L-v 

64677 Lggtc^ggccggcagcgcgagg^aggccgccttcaccgccaagaacgaggagctgttccgggccag^ 

vvFTROREVEAAFTAKNEELFRASFGAQi- 1 i> 
647 6 9 CGGGATCATCATGCCGGCGATGATGTTCATCGGGAACCTCAGCTACGT 

•> r 1 IMPAMMF IGNLSYVAIAVVGGLRVASGS? 
64861 TGAGCATCGGCGACGTGCAGGCATTCATCCAGTACTCCCTCCAGTTCACCCAGCCGCTGACCCGGGTCGCCTCGATCGCCAAC 

>m cTGDVOAFIQYSLQFTQPLTRVASMANLLry 

64953 TCCGGGGTGGCCTCCGCCGAGCGGGTO^ 

>«qGVASAERVF^AVLDAEEQSPDPAVPARVADg 

65045 GCGCGGTCGCGTCGAATTCGACCACGTCTCA^CCGGTACGAGCCGGACAAGCCGCTGATCACCGACCTGTCGCTGGTCGCCGA^ 

>RGRVEFDHVSFRYEPDKPLITDLSLVAEPG 

65137 ACACGGTTGCCATCGTCGGGCCGACCGGCGCCGGCAAGAC^ 

>HTVAIVGPTGAGKTTLVNLVMRFYELDAGRI 

6 5229 ACCCTCGACGGGGTCGACATCACCACGCTGAGCCGC^^ 

>TLDGVD ITTLSRDDLRGR I GMVLQDTWL- F G G 
65321 C ACG ATCCGCG AC AAC ATCGC GT ACGG C CGGC CGG ACGC G AGCG AGG AGG AG ATCGTCGC CGCCXXTCCGGGCG ACG TTCG TGGAC CGGTTC G 

>TIRDNIAYGRPDASEEEIVAAARATFVDRF 

65413 TGCGTAGCCTCCCCGACGGCTACGACACCGTCATCGACTCCGAGGG 

>VRSLPDGYDTVIDSEGSNVSAGEKQLITIAR 

65505 GCGTTCCTGGCCGAGCCGTCGCTGCTGATCCTCGACG^ 

>AFLAEPSLL.ILDEATSSVDTRTEVLLQRA MA 

65597 GGCGCTGCGCTCGGACCCXSACC^^ 

>ALRSDRTSFVIAHRLSTIRDADLILMMEHG 

65689 GCATCGTCGAGCAGC^CACCCACGAC^ 

>RIVEQGTHEQLLAARGAYHRLYQAQFTQP U V 

65781 GCCGCCGTCGGGGACCCCGAGCCCCA^ 

65872 > C3GGCAGC^CCGC^CCCCgWcGCGG^^ 

659 64 CATCGGCGGGAACATGTCGTCGGCGAAGTGGC^ 

66056 TGGTCCGTACCGCGCTGCGCATGGOGAGACGGTAGGGCGGCC 

66?48 C^TCCGGTTACCGCGAAGGGCGACAGGCCGCCAGTCGCTACAGTCGGCAACCCGGCCGATGCAC^ 

66240 C^ACCTGCCGGCGGCCCATGGCC^ 

66332 GATCCGCCGACCGCCTCCACCCGTACGCCGAGGra^ 

66424 GACGACCCGGTTCGTGGCGGGTGCCGGCGAGGTGGAGCCGCCCAGTTACTCCAGCTCGTGGAGCATGAGC 

- - ELEHLMLQRAAET I 



66515 GAGCCCGACAGGCTCGGGTAGATGGTGATGGTCTGGGCCAACTC 

<SGSLSPYITITQALENVTLNNEVAMTIPLIL 

66607 CTCGCTGGCCTTCGGTGCCACCACCACAC^ 

<ESAK PAVVVGG I VQGSAPRCF LKVFGDALDD 
66699 CCATCTTCGCCCGGGCGTTGCCCGACAGCGGCAGC ATCACCTGGCGGGCCGGGGTCTTGCC^CGTCC ACCTCGTCCTGGGAGACGCCG ACG 

<MKARANGSLPLMVQRAPTKGADVEDQSVG V 
66791 GTX3GCCAACTCCGGGTCGGTGAAGACGTTCGCGGCCACCGTACGCAGCCGCAGCGGCCGGACCGCCTCGCCGAGCGCGTGCCACATCGCGAT 

<T ALEPDTFVNAAVTRLRLPRVAEGLAHWMA 



Figure HQ 



66883 CCGGCCCTGCATGGCGGCGACGCTGGCCAGCGGCA^ 

<RGQMAAVSALPLVGTCDGAAYIGPVNTRSVR 
66975 GGTCGACGGTGACGTAGCCGCCCCGGGCCAGCTCGACGCCGTACTCGGCGAGGCCGAGGTTGGCGGTGTTGGGGATCGAGCCGACCGCGATG 

< DVTVYGGRALEVGY EALGLNATN P I S G V A I 
67067 AGCGCGTGCGAGCCGTGCACCAGCCGGCCGTCGGCCAGTTCGACCTCCAC^ 

<L AHSGHVLRGDALEVEVGDAIRQVREARSNN 
67159 G AGGATCGTCATGCCCCGGGAGCGGAACACGCGC^CGATCGCCATGGCGGCGTCGGCGTCCTCGTGCGGCATCACCCGGTCCCGGCTGG 

<L ITMGRSRFVREI AMAADADEHPMVRDRS S V 
67251 C G AGGGTG ACCGGG AC C CCC ATGGC C AGGT AC G CG CTGG CG AACTCGGC AC CGGTG ACGC CGGAAC CG AC G ACG ATC AG G TG C TCGGG C AGG 

<LTVPVGMALYASAFEAGTVGSGVVI LHEPL 
67 34 3 TGCGGCAGGTCGTACACCTGXTCGCCAGGTCAGGATGCGCTCGCCGTCCGGCACGGCCGTGGGGAGCTGGCG<KXtGGTGGCGCCGGTGGCGAC 

<HPLDYVQRWTLIREGDPVATPLQRPTAGTAV 
674 3 5 CAGCACGGTCGACGCGTCGATCGAGTGCTTCTCGGAGCCGTCGGCCGGCGTGACGACGACGCGGTGGGTGTGGCCCAGCATGTCCTCGCCGA 

<LVTSADISHKESGDAPTVVVRHTHGLMDEGL 
67 527 GCCGGGCCGTGCCGGCCACGAAGGTGACGCCGGCTTTCACCAGCTTCGCGTGGATGTCGGCGGACTGGGCCAGGGCGAGC 

<RATGAVFTVGAKVLKAHIDASQALALRKVR 
67 619 TCGTGCACGGCCCGGGCGTCGACGGTGACCGCCTCC AGCCCGTCGGAGTGC ACCCCGAACTCCTCGGTGTCCCGGTACCCGGTGACC ACCTC 

<EHVARADVTVAELGDSHVGFEETDRYGTVVE 
67711 CGAGCTGGCGATGAACGTTTTCGACGGTACGCAGTCGGACAGCACGCAGGCACCGCCGGCCCCCTCGGCCTCCACCACGGTGACATCAGCGT 

<SSAI FTKS PVCDS LVCAGGAGEAEVVTVDAD 
67 8 0 3 CCAACTGGGCGGCGACCAGGGCCGCCTCGTACCCGGCCGGCCCCCCGCCGATGATCACGATCTGGCTCACAGCTATCGCCCCTCGTCCGT 

< LQAAVLAAEYGAPGGG I IVIQSV 

67 89 3 GCTC AC AGTGACTTTCTTCTCCCCGACGCGTCCG AC ACGCACCGTCGTATTCTCCCCCAGCCGTCCGCCGGGCTATCGTC ATCGCCGTGCG 

> V R 

67 984 TCACTACGCCGCCTACGGCTCAAACCTGGACCCCGCCCGGATGCGCGCCTACTGCCCGCACTCCCCGATGGTCGGCGTCGGCTGG 

>HYAAYGSNLDPARMRAYCPHSPMVGVGWLE 
68076 GCTGGCGGCTCACCTTCGCGGGTGAGGGCGCGATCGGCTGGGAGGGCGCGGTCAGCACCATCGTCGAGTCCCCCGGTGATCGGGTGTTCGTG 

>G WRLTFAGEGAIGWEGAVSTIVESPGDRVFV 
68168 GCGCTCTACGACATCCACCCGTACGACGCCGTCC AGCTCGACGAGATCGAGGGGGTGGCCTCCGGGACGTACCGCAAGCTGC ACGTCCGCGT 

>ALYD I HPYDAVQLDEI EGVASGTYRKLHVRV 
682 60 CTCCACCCTCGACGGCGACGTGACCGCGTGGGTCTACGTCTTCGACGGGTACGAGGGCGGCCTGCCGACGGCGTGGTATCTGTCGGAG ATCG 

> STLDGDVTAWVYVFDGYEGGLPTAWYLSE I 
6 83 52 CCAACGCCGCCGAGAAGGCGGGCGCGCCCG ACGACTACGTCAGCGAGCTGCGGTCCCGCCCCACCGGCACGGCGTCGGCGTAGCGCGTCTC 

>A NAAEKAGAPDDYVSELRSRPTGTASA • 
68443 CCACACTCCCAGTCTGCTCCGCCCGAGACGGGGCCGCACGGCGGCCCCCGCCGGGGGTCGTCTGTCACACATCATGGTCGCGCCCGTCACA 

< • V 

685 34 CCGCCGTGGCGGCGGGGACGGTGCGCTCGTACATGTCGGTCCAGCGCATCTCGCGCAGCCCCACCGAGCGGTAGAGCGTCGCCGGGGAGGTC 
<ATAAPVTREYMDTWRMERLGVSRYLTAPST 

6862 6 GGGTTGGTCAGGTCGACGCCGAGGCCGGCGTGCCGCCGTCCCTTCGCCGCGTAGACCGTGAAGGCCCGCCACAGCAGCGCGGCGCCGACCCC 
<PNTLDVGLGAHRRGKAAYVTFARWLLAAGVG 

68718 GTGCCGCCGGTACTTCGGCAGCACCGACAGGGTCCGCACCCAGCCCG AGTCCTGTTCCAGCGCCTGGTCGGACGACTGCAACGCGCCGGCCG 
<HRRYKPLVS LTRVWG SDQELAQDSSQLAGA P 

68810 GCTCCCCGTCGACCTCGGCGACGAACCACTCGTCCC AGGTCCTGTCGTACGCGGGCAGACGCTCCCGCCAGTGGTCGTACCCGGCCGGCTCG 

< EGDVEAVFWEDWTRDYAPLRERWHDYGAPE 
68902 TAGTCCGGGGTGTCCCGGAACGCCGTGTCGTAGATCCGGTGGAACAGGCGCAGGTCGTCCTCGTCGCCGGCGCGC AGCGGCCGGACCGTCAC 

<Y DPTDRFATDY I RHFLRL'DDEDGARLPRVTV 

68994 CCCGGGTGGGGGCGGCGGCTCGGCGGGCAGCCCGGCCAGGTCACG ACTCATCCGTACGTACCGCTTCACCCGGCTGAAACCGGCCTCGGTC A 
<GPPPPPEAPLL»ALDRSMRVYRKVR£FGAETL 
69086 GCTCCGTCACCCAGCGGGTCTCCGGCGGGTAGGCCGAGGCGCGGACGGTCAGCGCGGGCAGGCTCCGCTCCGCGGCCCGCTCGGCGACCCGG 

< ETVWRTEPPYASARVTLAPLSREAAREAVR 
6917 8 TCCAGCATCAGAGCGAGCAGCGGGGCGCGTACCGCCTCGGCGCGCTCCGGGTCGACGAGGACGTCGACGAACT^ 

<DLMLALLPARVAEAREPDVLVDVFERGVGTP 

6 927 0 GTTGTCCACCACCGACCAGG<:GACGAGCCGGCCCTGCGGGTCGCTGACC^GCCAGGAGTCGCGCGCCGGGTCGAAGAAGGGGGCCGTC AGGG 

<NDVVSWAVLRGQPDSVLWSDRAPDFFPATLA 

693 62 CGGCCTTGACGTCTTCGGCGTCGAAGT'CCGGGTGGCCG 

<AKVDEADFDPHG I AFTDAAHVVALIGPVDD 

694 54 AGGGTGGGGCGGCGCGCCGCCCAGTCAGCGGGAAGAGTCACGGGGCGATCCTGGCAGCC ACCCCGGTCCCGCGCCCCTCATTTTCAACCGC 

<L TPRRAAWDAPLTV 
69545 CCCGCGCCCTGCCCCCGCCGCCGCACGCGCTCGCCGGCGAGAGGGGACCCTTTCT^ 

<* YPARGQVV 

69637 GGCGCGTGCGGTGGTCAGCAGGTCGAGCAGCGCGGTGCCCTC 

<ARATTLLDLLATGEAPTLARFGTSVLRDTVA 
69729 CCTCCGACCACAGCCGGCGGCGCACCAGCGGCCCGACCCGGGGGTACGGCGGCGGCCAGCCGCAGGCC AGCGCCCCGGCCTCCCCCTCCGGC 

< ESWLRRRVLPGVRPYPPPWGCALAGAEGEP 
69821 CCGGCCAGCACCGCCTCCAGCGGCGTCATCCCGCCGGCCCGCACCGC^ 

<GALVAELPTMGGARVALLYAGAFHERLLLLG 
69913 GGCGGCG<XrACGGGCCCCCGGGGTGTCGTCCXX5CGGCGGCACGGCC 

<AAARAG PTDDP P PVARWAAFLPMG SADAA DV 

7 0005 CCACCCGGTCCAGCAGCGTCGCCAGACGGATCACGCCGGGCACGGCGGTCAGGTGCTCGCTGCCCCA 

<VRHLLTALRIVGPVATLHESGWRCCEALNA 
7 0097 GTGGCCACCTCCAGCGGAGCGTGC ACGCGGGCGGCGGCGTCCCACCCGTCGGCG ACCGCGTCGGGGGCGACGAAGCCGAGGGCCGCGCTCAC 

<T AVELPAHVRAAADWGDAVADPAVFGLAASV 
7 0189 CGTGGCGGCCCCGACGTCGCCGAGCACCCCGGCCCGGCCGGCGATGTGGAAGGCCCAGCCGGAGATGCCCAGCAGACGGGCCCGGTGCAGGG 

<TAAGV DG LVG ARGA I HFAWGS IGLLRARHLT 
7 0281 TGGCCGGGCAGCGGGCGAACATCTCCCCGAGCTCCAGCACCAACGGCTTGCTC 

<APCRAFMEGLELVLPKSAAAVQEPTM < • D 

70372 GCCCCGCCGGGGCCTTCCGTGACAGCCCCGTCTTCGGCGTCGAC^ 

<A GGPGETVAGDEADVAEIAGEIEGTRRQAAT 
70464 GACGGCCCGTTCCGCCGCCCGCCGGGCCAG(?TTCCGCCGGCTGAGTT^ 

<VAREAARRALKRRSLEQEAEARSRELEALSR 

7055 6 GCTCGATGCCGGTCAGTTCCTCGGCACCGTCGTGCTCGGCCTC 

< EIGTLEEAGDHEAEVAGALEAEAREQDTRA 



Figure 1 1 R 



7 0648 CGCGCCAGTTCCCGTTCCAGCATCC^ 

<RALERELMRRQRAREARAAREAERAARK«Rt- 

7 0740 tggtgggtgggtgggcggtggctgctot^ 

<PPHTPPPQEEGGTVLRLQPRPVEGFGmYS«^ 

70832 CCCGCAGCAGCCG<^CGGAGCGCACCTGCCCGGCCACCTCCG^^ 

<rllrgsrvqgavetdslaadltaeveglpl 

70924 TTCCCGGCCCXSCG^CCGCCCTCC^ 

<kgappggeadaalrraeavlaavaarrqasl 

71016 tcccgcagccc^gccgcgcaggtcgcgc^ 

<erlrpgrldrqarrlaealqtldav^eprr~ 

71108 gggcgagcaggttgaccagccacgcggccacggtcgggcggcgcagccgggcgatctcgcgggccgtc^ 

<ALLNVLWAAVTPRRLRAlERATAPDGSRR rt 
712 00 TCGGCGACGGCGGCGTCGCGGGTGGCGACGAACTTCTCCGGCGGCTCGGTGTAGAGCCGCCOT 

<eavaadrtavfkepfetylrrllsqppp\ 

712 91 agacgtcgagccggctgcccggctcgaggcgctggt agtcggtccccgac agcgcggcgtactgacggttgaggacggcg aggccgttgtcg 

<vdlrsgpelrqydtgslaayqrnlvalgnd 

713 83 TTGAGCAGCCCGTCGTGCAGGGCGAAGGCTCGGCGCGGGGCGACCGCGCGGATGAAGTCGAGCAC 

<NLLGDHLAFARRPAVARIFDLVESFKSVy,P A H 
7147 5 gatcggcgcgaagagggtgtccacgggggcgtcctcgggtgccacc agggcgtcgccggggtggtagacg ACGTCGTTC AGC AG^TACCCo a 

<IPAFLTDVPADEPAVLADGPHYVVDNLL\GL 
71567 GGTTGTCCACGACCGGGATGTCGGGATGGATGACGGCGTGCCGACCGCCGTACGCGCGCACCGCCACGCCCGCCGCGGTGAACGCCTGCCCG 

<ndvvpidphivahrggyarvavgaatfac g 
71659 ggtgagatgggctccagggcttcggccacgtcgccgagggcaccggccagcgaggccgggccgtagatgcggaacggcca^^ 

<PS ipelaeavdglagalsapgyirfpwrdlq 
71751 ccgggtgagagccgcgacgtccacgtcgtcggggtgctcatgggtgatcagcaccgcgtccgcaccgtccagcgcggtcgggtcgc^ 

<RTLAAVDVHDPHEHT ILVADAGDLATPDS F V 
7184 3 CGCCCGGGTCGACGACCAGCACACCGCCGTCGTGCTCGACGCGGAGGCAGGAGTG 

<GPDVVLVGGDHEVRLCSHAFKTLQM 




72025 CGGAGCGGGAATGGACGGACAGGGAA^ACUUA^i .wu^wtwv^ , ^ . — * — 

72117 G^ATG^CGGGAGAGCCTTCGGCGGCGGTGG 

> M R ^* R V 



7 2300 ACGATGTGGACGCCGCCGCCCGGTCCGCC ATC ACGGCGGTC ACCGGCCTCGCCGGCTTCGTCGGCCGCG ACGAGCGC AGC AGCGGCGG AACC 

- - — - **- T *GVGGFVGGDERS SGG1 



72392 > gcccLcgccc<m^^ 

iADARAELOLRVPAERFTAVLEELARLGRQEQ 
72484 GCGGGCGATCCGCACGGAGGACGTGACCGAGGAGACCGTGGATCTCGACGCGCGGATCGCCACCCAACGGGCCCGGGTCGAGAGCGGTCGCA 
>RAIRTEDVTEETVDLDARIATQRARVfcbL. k 

72576 AGCTGCTOGCGCGOXX»CCTC» 

»vt r ARATSIGDLVTLESEVARREADLASLtA 

72668 LGAAGCGCCGGCraCCC^CCTGACCra 

^KKRRLADLTSLSTITLTLVGPEAEARDTfcFU 

72760 CACCGGCCTCGTGGTCGGCCTGCGCGGCGGCTGGACGGCGTTCGTCGC 

>TC FVVGLrRGGWTAFVAS LGVLL-TVL.CjAl-.i-r 
72852 TCGCGGT<^CCCTCGGCGTGCCGGTGGCGGTGCTGCTTGCGGTGC^ 

>F AVALGVPVAVL LAV LR R R R RR P P A < . R A 

7 294 3 GCCGCCAGTGCCCGCAGCGCGGTCTGCACC ATGACCCGG ATGCCG ACCGCG ATGGCCCCCTCGTCGACGTCGAACG AGGCCCGGTGCAGGT 

<A P A P L \ P R \ \ % S Q ^ P M *V R I G V A I A G E D V D F S A R H L D 
73034 CGACGTTCGGGCCGGACCGGGCGACGCCGAGGCGGGCCAGCGCGC 

<VNPGSRGVGLRALAGPVYELYWSFDEGGM5> 

7 312 6 TGCC^TCTCCGCGACCCCCTCCGGGCCGAGCG 

<OPTEAVGEPGLAAHTAATLVQ I ARADNTVPP 

73218 CCGGCCGCGTAGGTACTCCAGGTCGACGGTGGCGCCGGTG 

<RGRLYELDVTAGTPAIVDRVVQAVIKPAQDW 

AGGTGTCGCGGTCCATGACCCGCAGGGTGCCGGACGCCX^ 
<TDRDMVRLTGSASAESPIVNYRTGASAH G F 

ACG AGC AGC AGC C CGCTGTTG<5CCGGC AC CCGGCGGCTG AC C AGGGCTGGC AC CTCG£TC AC C AGCCGG CCGAGCGCGTCG ACC AGGTCGAC 
<VLLLGSNAPVRRSVLAPVETVLRGLADVLDV 

73494 GGTCAGGTGCG<^CGAGCGX3TGTGCCCGCCGGGGCCGGTGAGCCGG^ 

<TLHPRATHGGPGTLRVTVNDAAATI PGVRLG 

7 358 6 CGACCTTGCCGACGGGCTGGTTGGGCTCGC AGTGC AGTGCGAAGATCTGC ACGACGTCGTCGAGACCGCCGG^ 

<VKGVPQNPDCHLAFIQVVDDLGGAEIVELS 

73678 ccgcagg<x:aggatctcctcggccggctggaagatcagccggacccggcc 

<C C P L I EEAPOFI LRVRGDLEGLNALQALLVG 

7 377 0 gacgccgagcagcacggtggtgtgcacgtcgtggccgcag<^ 

<VGLLVTTHVDHGCAHCVGDKTSRYPVDKvDT 

7 3862 TC AGCGGCAG CGCGTCG ATGTCGGCGCGG AGCGCG ACC AC C GGGC CGTCGGGGCGGC C GTCG ATGTCGC AG ATG AC C CCG TTGCCCTTTGGC 
<LPLADIDARLAVVPGDPRGDIDCIVGNGKP 

73954 AGCAGGCGCGGGCGCAACCCGGCGAGCGACAGCTGGCGGGCGATCAGGGCCGCCGTCT^ 

<LLRPRLGALSLERAILAATEFEEGSLEPHSH 

74046 GATGTGCCGGCGGGTGGCGATAAGGCCGGGCATCCGGAG<X;C 

<IHRRTAI LGPMRLALLHDLEFPLPQSGSPSE 
7413 8 C CGGCC AGGC CG ACG ACG C C AGGTGGCTGCCGTTCGGC AGCGTC AACGC ACTCGTC ACGTCG AAATC TCG ATC ACT AGAAACGG ATGG ATC 

74229\t^AG^GATGACAG?cGCCAGCCTAGACC^ 

Figure US 



< R G 
73310 AGGTG' 



73402 



7 4 68 9 CTCATCCGAGACAAGGGGTCAGAACCGGTCGCTGGGGCGGTACGTCCCCCACACCTGGCGCAGGGTGCCACAGACCTCGCCCACCGTGGCC 

<• FRDS PRYTGWVQRLTGCVEGVTA 
74780 CGGGCCCGCAGCGCCTCCTTCATCGGGTGCAGCACGTTCGCCGTACCCTCGGCGGCGGCCCGCAGCTCACCCAGCGCGCGCTCGACGGCGTC 

<RARLAEKMPHLVNATGEAAARLEGLAREVAD 
74872 GCTGTCGCGCTCCACTCGCAGCTTGGCCAGCCGCTCGGCCTGGGCAGCCTCGATCGTCGGGTCCACCCGCAGCGGCTCGTA^ 

< S d R EVRLKALREAQAAEITPDVRLPEYPEDA 
74964 CGTCGACCGTGAACCGGTTGAGGCCGACCACCACCCGCTCGCCCGAGTCGATCTCCTGGGCGATCCGGTACGCGGACTGCTCGATCT 

<DVTFRNLGVVVREGSDIEQAIRYASQEIER 

75056 TTCTGGAAGCCCGCCTCGATGGCGTCGACCACCGA 

<K n GAEIADVVSGHDAVREMLEVVAAEIEAT 
75148 CATCGCCTCCACCACGTACGACCCGGCGAACGGGTCGACGGTGGCGGTCAGGTCCGTCTCGTACGCGAGCACCTGCTGGGTGCGCAGCGCCA 

<MAEVVYSGAFPDVTATLDTEYALVQQT^RLAL 
75240 GCCGGGCGGCCTTCTCGGTGGGCAGCGCGATGGCCTCGTCGAAGCTGTTCGTGTGTAGCGACTGGGTGCCGCCGAGCACCijCGCCCAGCCCC 

<RAAKETPLAIAEDFSNTHLSQTGGLVAGLG 
7 53 32 TGGATCGCCACCCGGACCAGGTTCACCTCGG<3CTGCTGGGCGGTGAGCTGCACGCCCGC 

<OIAVRVLNVEPQQATLQVGATQTHFRLMMSK 
7 5424 CGGGTTCTTCGCGCCGAACTCGTCGCGC ATCAGCCGGGCCCAGATCCGCCGGGCGGCACGGAACTTCGCGACCTCCTCC AGC AGGGTGGTCC 

<PNKAGFEDRMLRAWI RRAARFKAVEELLTTR 
75516 GGGCGACGAAGAAGAACGACAGCCGGGGCGCGAAGTCGTCCACCGCCAGCCCGGCGGCGAGCGCGGCCCGGACGTACTCGACGCCGTTGGCC 

<AVFFFSLRPAFDDVALGAALAARVYEVGNA 
75608 AGCGTGAACGCGATCTCCTGCGCGGGCGACGCGCCGGCCTCGGCCATGTGGTAGCCGGAGATGGAGATGGTGTTCCACTTCGGCACCTCCGC 

<LTFAI EQAPSAGAEAMHYGS I S I TNWK PVEA 
7 5700 CCGGCAGTAGGCGAACGTGTCGGCGACCAGCCGCAGCGAGG^CTTCGXSCGGGAAGATGTACGTGCCCCGGGCGATGTACTCCTTGAGGATGT 

<RCYAFTDAVLRLSPKPPFIYTGRAIYEKLID 
7 5792 CGTTCTGGATGGTGCCGTTGAGCGCCGCGCCCGGCACCCCCGCCTCCTCGGCGACGAGCTGGTAGAGCAGCAGCAGCACCGACCCCGGGGCG 

<NOITGNLAAGPVGAEEAVLQYLLLLVSGPA 
7 5884 TTGATCGTCATCGAGGTGGAAACCTTGTCCAACGGGATGCCGTGGAACAGCAGCCGCATGTCCTCGATCGAGTCGATGGCGACGCCGACCTT 

<NITMSTSVKDLP IGHFLLRMDEI SDIAVGVK 
7 5976 GCCGACCTCGCCGTGGGCGATCGGGTCGTCCGAGTCGTACCCCATCTGGGTGGGCAGATC AAAAGCG ACGGAG AGGCCC ATGGTGCCGGCGC 
<GVEGHAI PDDSDYGMQTPLDFAVS LGMTGAR 
7 6068 GCAGGAGCTGGTGGTAGCGCGCGTTGCTCTCCGTCGCGGTGCCG AAGCCGGCGTACTGGCGC ATCGTCCACGGCCGCGAGGTGTAC ATGGTG 

<LLQHYRANS ETATGFGAYQRMTWPRSTYMT 
7 6160 GAGTAGACCCCACGGGTGTACGGG AACTCCCCCGGCTCGCCCAGCCGCTCGGGC AG ACCCTCCGGCAGGTCCCTCTGGGTGT AGAC ACCCTT 

<S YVGRTYPFEGP EGL.R EPLG EPLDRQTYVG K 
7 6252 GATCGGGAAGCCGGACTCGCTCGACCGCGGTTCACT^^ 

76343%GCACCCcWrWpTC^^ 

76435 ?^cg^aaccagcaagatagaggagttgtgtcccagccccctcgatttcccccggtg^ 

76527 GCG^GGACCAGTC^GCCCACC^ 

76619 GGCATGGGCAC^TCTGGCGTGCCACAGACACCCTGC^ 

>MGTVWRATDTLLRR DVAVK EVV L. P PG LA P 



7 6710 GCGACCGCGACGCCATGTACGAACGCACGCTGCGCGAGGCCCGCGCG 

>SDRDAMYERTLREARAAAAIQHPAVVQVYDV 

7 6802 GTCACCGAGGGTGGTCGCCCCTGG ATCGTGATGG AGCTGCTGGACGCCCGCAGCCTGGCCGAC ATGGTG ATCG AGG ACGGGCCGGTGGCCCC 
>VTEGGRPWIVMELLDARSLADMVI EDGPV A P 

76894 CCGCGCGGTCGCCAAGATCGGCATCGCCCTGCTCGGCGCGCTGGAGGTGXSCCCACGCGATCGGGGTGCTGCACCGCGACGTGAAGCCGGCCA 
>RAVAKIGIAL r , GALEVAHAIGVLHRDVKPA 

7 6986 ACGTGCTGATCTGCACCGACGGCCGGTGCGTGCTGACCGACTTCGGGXS^ 

>NVLICTDGRCVLTDFGVAKLPTDVQLTTPGM 
77078 GTGCTCGX3CTC GCCGC ACTTC ATCTCCCCC G AGCGGGC C ATGGCC C AG<LAGTTCGG C CCG CC G AGCG AC CTG*TTCTC CCTCGG CGTC ACGCT 

>VLGSPHFISPERAMGQEFGPPSDLFSLGVTL 
7717 0 CTACACGGCGGTGGAGGGCCGGCCCCCGTTCGAC AGGGGCG ACCCG ATCG AGACCATGC ACGCCGTGGTCG AGG ACCCGCCGGCC ACGCCGC 

>YTAVEGRPPFDRGDPI ETMHAVVEDPPA T P 
77262 AGCGCAGCGX^CCCGCTGACCCGGGTOCTGATGGGGCTGCTGGAGAAGGAC 

>QRSGPLTRVLMGLLEKDPARRLDVHTARA M L 

77354 CGCGAGCT<X;TCGCCGGCCCGCTGACCAGCACCGCCACCGCCGTCAACTCG<rrCACCGACCCGTA 

>RELLAG PLTSTATAVNSVTDPYAVVPVKQ R P 

77446 G^CGTCGCCCCACCGCCCTCCGCTCCGGAGCCGAAG^^ 

>AVAPPPSAAEPKPSGQIGGRAMLAPGESL TT 

7753 8 ACCGX3CTGGCGCCCCTC^GCCGGGGCGAGAAGACGAGGAAGAGGAAGACGACGACCGCCGCCGC 

>DRLAALRRGEKTRKRKTTTAAALDDTSADAL 

7763 0 GCCGGCCCGCTGCAC ACCCCCACCG<^CGATGCCCCKXX:CCCCGCCCGCCGG^ 

>AGPLHTPTGAMPAPPPAGRTYGGSSEAT Q R V 
77722 CGACGCGGGGACCGCGCCGGAGGCCACCCAC^GGATGACGTACXK5CAG<^CGCCC^ACGCCACCCAGCGG^ 

>DAGTAPEATQRMTYGSP PDATQRVSHGSG P 

77814 CGGAGSCC ACCC AGCGGGTGCCCTA03GX:G^ 

>SEATQRVPYGGGSADATQQVPFGRRPDATQR 

77906 GTCCCCTACGGCAGCCAGCCCGX^GCGA^ 

> V P Y G S Q PGATQPV PG FGASPDATQRVGGAYG 

77998 cggcggccagt<^tccx;tgcccggcaccggccagccgtgg^^ 

>ggqwsvpgtgqpwatpatapapataggggv 

78090 GCCGC CTCGTCGC CACG^TCAAGGGCTGGCCGCG C AAGGT<KI AGXTTCGCCGCGGCCGGCGGCGTCGCCGTCCTGCTG CTG ATCGGCGTGTTC 
>GRLVATVKGWPRKVQLAAAGGVAVLLLIGVF 

78182 GCCCTCTTCGGCGGCGACGACCCGGAGCAGCCGACCACCCCGCAGGGGCAGCC 

>ALFGGDDPEQPTTPQGQPSAGAPAG PGVEM Q 

7 8274 GGAGCAGTCGGCG^GGGCGTCACGGTCCAGGra 

>EQSAKGVTVQVPKGWERRSADGGVWVDY ID 

78366 CGGAGGACAACAGCCGCAAGGTGCGCATCCTCGCCGAGCGGTOT^ 

>PEDNSRKVRILAERWSGTSTRWAETAANGL R 

7 8458 ACCCGGTCGGCCTCCTGCCAGAAGCCGTACAACCAGGTCTCGATGACCGAGCAGGAG^^ 

>TRSASCQKPYNQVSMTEQELDGKAAAEFEYT 

7 8550 CTGCGGCGACGGCGAGGGCAAGCGGCACGGCGTGTGGCGC 
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>CGDGEGKRHGVWRGVVHEGKVYS FY LSSTD 
7 8 642 CCCGCTTCGCCGAGAGCAAGCCGATCTTCGATCAGATGGTGGCGTCGTTCAAGCTCCGCGGGAGCGACTGAGCCGGGCCGGGGGCCGACGC 

>ARFAESKP I FDQMVASFKLRGSD 
7 8733 GACGCCGGCCGGGCGGGCCGACGCGACGTGGTGAGCCGCCGCCGTGCTATCAAGAGCCATGGCGGCGGACACCACTGACCTCGACGAC ACG 

>MAADTTDLDDT 
7 8824 CGCGATCTGGACGACCTTCGCGACCGGGCCCGGCGGTGGCTCGACGACGACCCCGACCCGGCCACCCGCGACGAGCTGG AGGCCGTGCTCGA 
>RDLDDLRDRARRWLDDDPDPATRDELEAVLD 
7 8916 CGGGCTGCCGGCGAGCGCGGCCGAGCTGGCCGACCGGTTCGCCGGGCC ACTGACCTTCGGG ACCGCCGGGCTGCGCGGCCCGCTGCGCGCCG 

>GLPASAAELADRFAGPLTFGTAGLRGPLRA 
79008 GCCCCAACGGGATGAACCTCGCCGTG<3TCACCCAGGCCGCCGCCGGGCTCGTCGCCTGGCTCGCCGCCCAGGACGCCACCGGGCCGCTGGTC 

>GPNGMNLAVVTQAAAGLVAWLAAQDATGPLV 
79100 ATCGGGTACGACGCCCGGCACGGCTCGCGGGAGTTCGCCGAGCGCACCGCCCAGGTGGCCACCGGCGCGGGCCGCCCGGCGCTGCTGCTGCC 
>IGYDARHGSREFAERTAQVATGAGRPALLLP 
7 9192 CCGCCCGCTGCCCACCCCCGTGCTGGCGTACGCGGTGCGGCAGCTCGACGCGGCGGCCGGCGTGATGGTGACCGCCAGCCACAACCCGCCCC 

>RPLPTPVLAYAVRQLDAAAGVMVTASHNPP 
7 92 84 AGGACAACGGCTACAAGGTCTACCTCGGCGCGC AGCTCGGCGGCGAGCTGGGCGCGGGGGCGCAGATCGTGCCGCCGGCCG AC ACCGGC ATC 

>QDNGYKVYLGACLGGELGAGAQIVPPADTGI 
7 9 376 GAGGCCGCCATCCGGGCGGTCGGCCCGCTGGCCGACGTACCGCTGGGCCCGGCCGGGCAGGTCGTCGGCGACGACGTGGTCGTGTCGTACGT 
>EAAI RAVG PLADVPLGPAGQVVGDDVVVSYV 
7 94 68 CGACCGGGCCGCCGCCGTGGTCGACCCGGCGGGGCCCCGGAGCCTG AAGGTGGCCTACACGCCGCTGC ACGGCGTGGGCGCGGCCGTGCTG A 

>DRAAAVVDPAGPRSLKVAYTPLHGVGAAVL 
7 9560 CCGCCGCCTTCGCCCGCGCCGGCTTCGGCATCCCCGGCGTGGTGCCCGAGCAGGCGGTGCCGGACCCGGACTTCCGGACCGTCAGCTTCCCC 

>T AAFARAGFG I PGVVPEQAVPDPDFRTVSFP 
79652 AACCCGGAGGAGCCGGGGGCGGTGGACCTCCTCGTCGCGCTCGCCGAGCGCACCGGGGCGGACCTGGCGATCGCCAACGACCCCGACGCGGA 
>NPEEPGAVDLLVALAERTGADLAIANDPDAD 
79744 CCGCTGCGCGGTGGCCGTCCGCGACGGCCGGGCGGCGGGCCCGGC ACCGGTGAGTGGGGGCGCCTGGCGG ATGCTGCGCGGGG ACG AGGTGG 

>RCAVAVRDGRAAGPAPVSGGAWRMLRGDEV 
7 9 83 6 GGGCGCTGCTCGCCGACCATCTCATGCGCCGTGGCGTCCACGGCCTGTACGCCACC ACCATCGTGTCGTCGTCCCTGCT ACGGGCC ATGTGC 

>GALLADHLMRRGVHGLYATTIVSSSLLRAM^C 
7 9 9 2 8 GCCGCCCGTGGCCTGCCGTACGACGAGACGCTGACCGGCTTCAAGTGGATCGTCCGGGCCGGCGGCGGACCGCTGGGTGAGGCCGGCTC^GA 
>AARGLPYDETLTGFKW I VRAGGG PLGEAGSD 
80020 CCCGCTGGTCTTCGGCTACGAGGAGGCGCTGGGCTACTGCGTCGCCCCGGAGCACGTCCGCGACAAGGACGGCATCACCGCCGCGCTGACCG 

> PLVFGYEEALGYCVA PEKVRDKDG I T A A L T 
80112 TCGCCGAGCTGGCCGCCGGGCTGAAGGCGCAGGGGCGCACCCTC ACCGACCGGCTCG ACG AGCTGGCCGCCGAGTTCGGCGTGCACC AC ACC 

>VAELAAGLKAQGRTLTDRLDELAAEFGVHHT 
80204 GACCAACTCTCGGTGCGGGTGGACGACCTGCGCATCATCGCCGACGCGATGGCCCGGGTCCGGGCGGCCACCCCGACGACCCTGCTCGGCCG 
>DQLSVRVDDLRI IADAMARVRAATPTTLLGK 
80296 CCCGGTGACCGAGGCGCG<^ACCTGC^CCCCGAGGCGGACGTXX5TGATCCTGCGTACCGACGGGGCACGGGTC^TGATCCGCCCGTCGGGCA 

>PVTEARDLLPEADVV I LRTDGARVVI RPSG 
80388 CCGAGCCGAAGCTCAAGGCGTACCTGGAGGTCGTGGAGCCGGTGGCGGACGGCGACGTGCCGGCCXXTCCG^ 

>T EPKLKAY LEVVE PVADGDVPAARTRAAATL 
80480 GCGGCACTCCGCACGGAAATCGCCGCCCTGGTGCAGGGATGAGGTGTGCTCCCGCTTCCGACGCTCTCTCAGCGG 

>AALRTEIAALVQG 
80571 CAGGTTCGTGGTGATATGCGGCCAGGTAAGCCACCGTCCTGCCACTATCCATGTCGTAGAACATGATGCGTGCCACTTGGATGTAGTAGTAG 

806 6 3 GTGACC AATG AGGTG AACCCGGGGCCCTGTTTG AGCGTGT ACGT AGCGCCG AAGCC ACTGGC AGTC AC ATCGTC ACGGCGGGCTTTCCCGTT 

807 55 AGCGCTCGCGTAGCTCAGCGCAGCAC^AACATCCTCCCCGCCATAGGGATACAGCAGCT<3GCTGGTGTATCCCTTCTCGAAGGGCTTTGCAG 

80847 GTTCTGCCGGCAAGGGCGAGAGGTCCTGGGCCTCCGTCTTGATCA 

80939 ATGACCGOCACCGGCAAGGTGTTTGGGAGCGTGACCGAAATGGACGTGAACCCGGGTCCGGGGCGCTT^ 
81031 GGCC^CCCGTCCGAGGAATAACTACGGAAC^ 

8112 3 TCACCTCGAAGGGCGCTACGGACGACCATCCAGCGCCCCAGTTGTCCTTCTGGCCATC AGGCGGCCAC ACCTGCTCGCAACCGTCTTCGTTG 
81215 TCCACAACCTGCCACACGGCGCTGTTGCCGGTAAGGGCATCGTAGAGCTGGAC ATACCGCCTGTCGTTCTGCACGACGGGAACGACGG AGGT 
81307 CCACCCCTTCCCGAAATTGCGCTGCCAGCGGGGACtGCCACTGGGGTCtGCATCTCGTCGT^ 
81399 CATTGCTGTTGTAGAACAGCACCGCTGG^^ 

81491 TCATAGAGAGCATTCCGGAGTGACATGGGTTTACCCCGTCCTAGGGATCAC^^ 

81583 CAATCGGGTGGAAGAAGAGCCAGCACCGGGCCTC ACTGGCCATCACAGTATCGTCTGACCTCTTCGAC^ 
81675 AGCGTTCCGCGCATCTGTCGGTCGGGCGTGTCGAAGGTGCGGCTCTTCGCCACGTGCGCGGCGAGCAG 
81767 TCTGCCACGAGGCGGCCAGGACCCAGACGGTG ACC AGCCGGCCC AGOG TXGCCGGTXIACGATGAA 

81859 CGGC GGCCCTCCGCGCCGG ACCGGG AACGCGG C ACGCTC AC CCCGCGTGATCC CTTGC CC ACG AAGGGCTC GTCTG AAT AGC ATC AGC AC AT 
81951 CAGGCCCCACCATTACCCGGCACCATTGGCATTCGCTTTCTT^ 

82043 TGCACGGTGCGTG AAACAAATGGCTCG ACCGTCCCGCGACGGCAGCTCC^C AC^CTGCTC^ 

>VPRRQLGRLLTQLRESAHI S 

82134 TCGACGCGGCC^CCGGCGAGCTGGACTCCTCGCGGCAGAAGCTCTGGCC^ 

>IDAAAGELDCSRQKLWRIERGLTSAKTPDVR 

82226 GTGCTCT<X:GAGCT<n*CCGGGCCACGCCCGAC^^ 

>VLCELYRATPDQASVLLGLAEVSRAEGWWHA 

82318 CCACGGCAGCTCCGTGCCG^CT<X3TTKrrCGCTCT 

>HGSSVPAWFSLYVGLENVASS I RHYNAELV 

82410 CGGGGCTGTTGCAGACCCCCGGCTACGCCACCGCGCTCTTCGAGCAC 

>PGLLQTPGYATALFEHNRPELGEEERKKAVG 

82502 TTCCGGACTCAGCGGCAG<^CT<5CTGGCCCGGCC^CTCCCCCCGGCCCCCGAGCTC 

>FRTQRQGLLARRLPPAPELTVI LSEAVLRR P 

82 594 GGTGCCGGGCCGATCGGTGATGGCCGACCAGCTCCGGCACCTGCT^ 

>VPGRSVMADQLRHLLAVGERHNITVRVLPL 

82 6 8 6 CCGCCC<;GK:CGCCC^T<X;CCGCCGAGGCCGC^ACGTTC^ 

>AAGPPLAAEAGTFVLLDFPLSALGSPTEP P T 

82778 GTCTACGTCGACX^CTCACCGGCGCGCTCTACCTCGACC^^ 

>VYVEGLTGALYLDQPTEIAAYERVWRGLDSL 

82870 CC<:CCTCGGCGCGCGACAATCAGCG<^GCTGATCGAT13CCATCC»3^ 

>ALGARQSAELIDAIRGECYE» 

82961 C ACCCGC AGCGGCACC AACGGCGGGGACTGCGTCG AGGTGGCCGAC AACCTC ACCGGC ATCGTCGGCGTCCGGG AC AGC AAGG ACCCGGGC o 

83053 GGCCC^CCCTGACCGTCCCGCCCGCCGCCTGGTCCGCCTTCGT^ACCGAGGTCAAGGCCA^ 

83145 CCAACCCCCGGCTACGCCTCAGACGCG&CCAGCGCCCCAC^^^ 
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83237 AGCCCGGGCCGCAGGGGCGCGTGCGCCTCAGACGGCCCGGGCCTCAGGCGCGCTTGCCGAGGGCGGCGTCGACCGCCTTGCCCAGGGCGGT 

< . ARKGLAADVAKGLAT 
83328 GACGACCAGGGCCACCGAGGGGCGGACCACCGAGTCGTCCAGGCTGACGGTGCCGGAGAAGCCGGCCCCGCCCGCGATCTCCTCCAGCCGAC 

<VVLAVS pRVVSDDLSVTGSFGAGGAl EELRR 
83420 GCCGGGCGTCGACGGCGGCGTCGCCGGTCAGGCCG AGCGCCGGCTCCATCCGGAGCACCGCGACCAGGGTGGCC AGCGCCGCCGTCGCCTCG 

<RADVAADGTLGLAPEMRLVAVLTALAATAE 
83 512 TCGGGGATCTGCCCGCCGGCCAGCGCCTCGGCCAGCCGCCGCCGGGAGTCCTCCTCCACGGTGGCGTCCGCCGTCGGGTAGCGGTGCACGTG 

<D PI QGGALAEALRRRSDEEVTADATPYRHVH 
83 604 GATGAAGCCCAGCTCGGTCTCCTCGACGTCGCGCACCACGCCCCGGGCGACCAGGTCGCCGAGGATCCGGTCGCGCAGGCCGTGGCGCAGCC 

<IFGLETEEVDRVVGRAVLDGLIRDRLGHRLR 
83 69 6 GCTGCACCCAGGAGGACGGGGTGTGCGGCGTGTCGGCGGCCATCCGGCCCAGGACCTCGTCCAGGATCGGTTCGCCGGTGGGCGCGGGGTCC 

<QVWSS PTHPTDAAMRGLVEDLI PEGTPAPD 
837 88 GTGACCACCAGGTTCCCATCGACGTACGCGACCCGGCCGGCGAGGGCCAGCTCGATCAGGACGGCGGCGGCCATCCCGAGGTCGAGGCTGAT 

<T VVLNGDVYAVRGALALEILVAAAMGLDLS I 

83 880 CCGCGGCATGGTCG^CTTGCCGGATTCGTCGTCGTACGCGAGGA 

8 3 9 7 0 < TAGCGCCTGAGCGCACCCCGTGCGCCCCCAACTCGCCCACGAGCGC ACTCGCCCGGTGAGAGGGGAACCCCGCTATACCGCAGGCGTTAACA 
84062 GGGGGCCCTTCCTTGCGATCAGAAGCGGGGCATACCGCCGAACTGGCGGTCGCCGGCGTCGCCGAGGCCCGGCACGATGAACATCCGGTCG 

<. FRPMGGFQRDGADGLGPVI FMRD 
84153 TTGAGGCTCTCGTCGATCGCGGCGGTGACGAGGCGCAGCGGCAGGCCGGACTGCTCCAGCCGGGCGATGCCG ACGGGCGCGGCGAGC ACGC A 

<N LSEDIAATVLRLPLGSQELRAIGVPAALVC 
8424 5 GAGCACGGTGATGTCGGTGCAGCCCCGCTCGGCC AGCAGCCGGCAGCAGTGCTCCAGGGAGCCGCCGGTGGCC AGC ATCGGGTCGAGG ACC A 

<LVTIDTCGREALLRCCHELSGGTALMPDLVL 
84337 GCACGGGCAGGCCGGCGAGGTCGCGCGGCAGCGACTCCATGTACGCGCGCGGCTCGTACGTCTCCTCGTCGCGGGCGAGGCCGACGAAGCCC 

<VPLGALDRPLSEMYARPEYTEEDRALGVFG 
84429 ATGGACGACTCGGGGAGCAGGGCGAGCGCGGCGTCGGCC ATGCCG AGACCGGCCCGCAGCACGGGTACC AGCAGCGGCGGGTTGGCCAGCCG 

<M SSEPLiLALAADAMGLGARLVPVLLPPNALR 
84521 GGTGCCCTCGGCGTCGGTGACGGGGGTCTGC ACCGGGTACTTCTCG ACGGGG AAGGAGCGGGCGGCCTCGTAC ACG AGC ATGGTGGTG AGCT 

<TGEADTVPTQVPYKEVPFSRAAEYVLMTTLE 

84 613 CGTGC AGCGCGGCCCGGAAGTTGGAGGAGTCGGTGCGCGCGTCCCGCATGGCGGTC AGCCGCGACTGGGCG AGCGG ATGGTC AATGACGTGT 

<HLAARFNSSDTRADRM 
84705 ACGTCCACGATCGCCCAACCTACCGAACGCCCGGGC CCCG AG AC ACG AGGAGCGACGCCGCTC AC CCGGCCCGTGCGGC AACCCTGGACGGT 
84797 TGGCGTTCGAAGGTAGGGCAGGTGCCCAAGGTCGGGCCCGACCTCGCCGCGTGATCAAGATCACGAGGCGTGCGGGTGCGTAGACTTCCGGG 
84889 CATGACGGCGACAGCGACGTCGGCCCGGTCGGACCTCTCCGAGCTGGGACGATCCGAGACCGCTTTGCGGAACTTCCTGCACGGCCTGCCG 

>MTATATSARS DLSELGRS ETALRNFLHGLP 
84980 GGCGTGGACCAGGTCGGCGCGGAGCAGCGGGCGGCCCAGCTCGGCACCCGCTCGATCAAGACCACGGCCAAGGCCCGGGCGATCGACCTGGC 

>GVDQVGAEQRAAQLGTRS I KTTAKARAI DLA 
85072 GATCCGGATGGTCGACCTGACCACCCTGGAGGGGGCCGACACCCCCGGCAAGGTGCGGGCGCTCGCGGCCAAAGCACTGCGCCCCGACCCGG 

> IRMVDLTTLEGADTPGKVRALAAKALRPDP 
85164 CCGACCCGTCCTGCCCGCACGTCGGCGCAGTCTGCGTCTACCCGGCGATGGTCCCGTACGTGGCCGAGGTGCTGCGCGGATCCGCCGGGTCC 

>ADPSCPHVGAVCVYPAMVPYVAEVLRGSAGS 

85256 GGGCGGCCGTCCG<^G^ACCG<3ACGGCAACG^^ 

>GRPSGGPDGNAPAGPGVVHLASVATAFPSGQ 

8534 8 GGCACCCCTGGAGGTCAAGCTCGCCGACACCCGGGCCGCAGTGGCGGCTGGCGCGGACGAGATCG ACATGGTGATC AACCGGGGCGCGTTCC 

>APLEVKLADTRAAVAAGADEIDMVINRGAF 
8 5440 TGGCCGGCCGCTACCGCGAGGTCTACGACGAGATCGTGGCCACCAAACAGGCGTGCGGGGACGCCCACCTCAAGGTGATCCTGGAAACCGGC 

>LAGRYREVYDEIVATKQACGDAHLKVILETG 
85532 GAGCTGGCCACGTACGACAACGTGCGCCGCGCGTCCTX3GCTGGCCATGCTGGCCGGCGGCGACTTCATCAAGACCTCGACGGGCAAGGTTCC 

>ELATYDNVRRASWLAMLAGGDF I KTSTGKV P 
85624 CGTCGCGGCGACCCTCCCGGTGACGCTGGTGATGCTGGAGGCGGTCCGCGACTTCCGCGCCGCCACCGGGCGGCAGGTCGGCGTGAAGCCGG 

>VAATLPVTLVMLEAVRDFRAATGRQVGVKP 
85716 CCGGCGGCATCAAGAACACCAAGGACGCGATCAAGTACCTGGTTATGGTCAACGAGACCGTCGGCCCGGACTGGCTGGACCCGGACTG 

>A GGI KNTKDA I KYLVMVNETVGPDWLDPDWF 
85808 CGGTTCGGCGCGTCC AGC CTGCTC AACG AC CTGCTCATGC AGCGC AC C AAG CTG ACG AC C GGCGTCTAC TC CGGTC C C G ACT ACTTC AC CCT 

>RFGAS S LLNDL-LMQRTK L.TTGVYSG PDY FT L 
85900 GGACTGAGCGTGATCTTCGAATACGCGCCCGCCCCCGAGTCCCGCTCGGTGGTGGACCTC AAGCCCTCGTACGGGCTGTTCGTCG ACGG 

>D« >VI FEYAPAPESRSVVDLKPSYGLFVDG 
8 5989 GGAGTTCGTCGACCCGGCCGACGGCGGCGGCTTCAAGTCGGTCAACCCCGCCTCCGAGGAGGTGCTCG 

>EFVDPADGGGFKSVNPASEEVLAEIAEAG£ 
8 6081 CCGACGTGGACCGGGCGGTCCGCGCCGCCCGGACGGCGTACGAGAAGGTGTG<XKXrCCGATGCCGGGCCGGGACCC^GGCCAAG 

>ADVDRAVRAARTAYEKVWGPMPGRDRAKYLF 
8617 3 CGGATCGCCCGGATCATCCAGGAGCGCTCCCGCGAGCTGGCCGTGCTGGAGTCCCTK^A 

>RIARI IQERSRELAVLESLDNGKPIRESRDV 
86265 CGACCTGCCGCTG<rrcGCCGCGCACTTCTTCTACTAC^ 

>DLPLVAAHFFYYAGWADKLPYAGFGPNPRP 

86357 TCGGCGTGGCCGCGCAGGTCATCCCGTGG AACTTCCCGCTGCTC 

>L GVAAQVI PWNFPLLMLAWKIAPALAAGNTV 
86449 GTGCTCAAGCCGGCGGAGACCACCCCGCTGACCGCGCTGCT^ 

>VLKPAETTPLTALLFAE ICQQAELPAGVVN I 
86541 CGTCACCGGCGCG^CGACACCGGCCGGGCGCTGGTCGAGCACCCGGGCGTGGACAAGGTCGCGTTCACCGGC 

>VTGAGDTGRALVEHPGVDKVAFTGSTEVGK 
8663 3 CCATCGCCCGGTCCGTCGCGGGCACCGGCAAG AAGGTC ACCCTGGAGCTGGGCGGCAAGGCCGCGAAC ATCGTCTTCGACG ACGCCCCGGTC 

>A IARSVAGTGKKVTLELGGKAAN IVFDDAPV 
86725 GACCAGGCGGTCGAGGGGATCGTCAACGGCATCTTCTTCAACCAGGGGCACGTCTGCTGCG^ 

> D Q A V E G IVNG I FFNQGHVCCAGSRLLVQESV 
86817 CGCCGAGCAGGTGCTGGAGTCGCTGAAGCGCCGAATGGCGCTGCTGCGGGTCGGCGACCCGTTGGACAAGAACACC 

>AEQVLES LKRRMALLRVGDPLDKNTDIGA I 
86909 ACTCGGCCGCCCAGCTCGCCCGCATCCGCGAGCTTGTCCGCCGCGG<jCGAGGCCGAGGGGGCGGAGCGCTGGTCGCCGCCGTGCGAGCTGCCC 

>N SAAQLARIRELSAAGEAEGAERWSPPCELP 
87001 GAGCGCGGGTTCTGGTTCGCGCCG ACGATCTTC ACGGGGGTC ACCCAG^GCACCXSGATCGCCCGGGAGGAGATCTTCGGTCCGGTGCTGTC 

>ERGFWFAPTI FTGVTQAHRIAREEI FGPVLS 
87093 CGTGCTGACCTTCCGCACCCCGGCCGAGGCCGTCGAGAAGGCCAACAACACGCCGTACGGGCTGT^ 
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>VLTFRTPAEAVEKANNTPYGLSAGIWTDKG 
87185 CCCGGATCCTGTGGATGGCCGACCGGCTGCGCGCCGGGGTGGTGTGGGCCAACACGTTCAACAAGTTCGACCCGACCTCGCCGTTCGGCGGO 

>SR I LWMADRLRAGVVWANTFNKFDPTS P F G G 
87277 TACAAGX5AGTCGGGCTACGGTCGCGAGGGCG<3CCGGCACGGGCTGGAG^GGTACCTCGGTGTCTGAGCG<;GTCGCG^ 

>YKESGYGREGGRHGLEGYLGV • 
87368 AAGCTCTTC ATCGGCGGGAAGTTCCCGCGCAGCGAGTCGGGACGGTCGTATCTCGTGCAATCCGCGAACGTGTCGCTGGCCTC CCGCAAG 

>VQSANVSLASRK 
874 58 GACGCGCGGGACGCCGTGGTCGCCGCCCGCGCCGCCGTGAAGGGCTGGGCCGGGGCGACCGCGTACAACCGGGGTC AGATCCTCTACCGGGT 

>DARDAVVAARAAVKGWAGATAYNRGQ I LYRV 
87550 CGGCGAGATGCTGGAGGGCCGCCGCGAGCAGTTCGTCGCGCTCGGCGTGCCGGCCGACGAGGTCGACGCGGCGATCGACCGCTGGGTCTGGT 

>AEMLEGRREQFVALGVPADEVDAAIDRWVW 
87 64 2 ACGCGGGCTGGTCCGACAAGCTCCCCCAGGTGTACGGCGGTGCGAACCCTGTCGCCGGGCCGTACTTCAACCTGTCCGCGCCCGAGCCGACG 

>YAGWSDKLPQVYGGANPVAGPYFNLSAPEPT 
877 3 4 GGGGTGGTGGCCGTGGTGGCCCCCGAGGCCCCCGCGCTGCTCGGCCTGGTCAGCGTGATCGCCCCGGCGATCGTC ACCGGC AAC ACGGTGGT 

>GVVAVVAFEAPALLGLVSVIAPAIVTGNTVV 
8782 6 GGTGGCGGCCTCGCCGACCCAGCCCCTGGCCTCGGTGACCCTGGCCGAGGTGCTGGCCACCTCCGACCTGCCCGGCGGGGTGGTC AACGTCC 

>VAASPTQPLASVTLAEVLATSDLPGGVVNV 
87 918 TGACCGGTGCGATC ACCGAGACGGTGCCG ACGCTCGCGGCGCACCTGGACGTCAACGCGATCG ACCTGACCGGGGTGGGCG ACGCGTCGCTC 

>L TGAITETVPTLAAHLDVNAIDLTGVGDASL 
88010 GCCACCGAGCTGG AGGTCAGGGCGGCGGAGAACCTCAAGCGGGTGATTCGGCCGGCCCCGGCCGACCACGACTGGTACGCCG ACCCGGGCCT 

>ATELEVRAAENLKRV I RPAPADHDWYADPG L 
8 8102 CACCCGGATGACGACGCTGCTGGAGACGAAGACGGTCTGGCACCCCAAGGGCGTCTGAGCCCACCGACCCGTCCACCGACCCGCCCGCC AC 

>TRMTTLLETKTVWHPKGV 
8 819 3 CCGGCCGCCGGAGGCAGGGGTGGGCGGCGGGTGGGGTGGATCTACTACGAGGGGTAGGATTGCCGCGTGACTCGGTTGGGTGATCTTGAGC 
8 8284 GGGCGGTGATGGACGTGCTGTGGGACACCGTCCCGGGCACGTCGGACGGGGTGACGGTGCGCGAGGTCGCCGAGGCCCTCGACGGCCGCGA 
>MDVLWDTVPGTSDGVTVREVAEALDGR E 
8837 5 GCTGGCGTACACGACGGTGATGACCGTGCTGGACCGGCTCGCCGGCAAGGGCATGGTGCGGCGCCAGCGGGAGGGGCGGGCCTGGCGCTACC 

> LAYTTVMTVLDRLAGKGMVRRQREGRAWRY 

8 84 67 AGGCCGCGGCCAGCCGCGAGGCGCACATCGCCCAGCTCATGCTCGACGCGCTGGACCTCGGCGGCAGCCGGGACGCCGCGCTGGTGCGCTTC 

>Q AAASREAH IAQLMLDALDLGGSRDAALVRF 
88559 GCCCGGTCGGTGACCGGCACCGAGGCCGAGGTGCTGCGCGCCGCCCTCGGCGCCGAGGCGGGCGGCCCGCTGACCGACCGCGTCGACGCGCC 

>ARSVTGTEAEVLRAALGAEAGGPLTDRVDAP 
8 8 6 51 GCGCGCCGACCGGGCCGGGCAGCCGGCCCTGCCCGACGAGGCGACGGACCGGTAGGGCCGCCGTC ATGGCGTACGCCGTGCACTTCGCCG 

>RADRAGQPALADEATDR • >MAYAVHFA 
88741 CGACGGTCCTGGCCTGCTACCTGACCGCTCAGGTCCTGGCGGCGTCCACCTGGACGTGGCGGGCCCCCCGGATCGCGATCGTCTGCTGGCAG 

>A T V L A C Y L T A Q V L A A S T W T W R A P R I A IVCWQ 
8883 3 GCGGTCGGGCTCGCGCTCGGGCTCTCCGCGATGGGCCTGCCGATGGCGCTCGGCGTGGCCGCGTACGACCGGCCGACCGGCAGCGCGTTGCT 

>AVGLALGLSAMGLPMALGVAAYDRPTGSALL 
8892 5 CGCCCTGGCC ACCGACCTGACCC ACGGCACCCTGCCGGCCGGGCTCGGCGCGGTCC ACCTCGGTCTGGTCGGGGTCGGGTTCGGCATCGGGG 

>ALATDLTHGTLPAGLGAVHLGLVGVGFGIG 
89017 CGGCGCTGCTCGCCACGACGGTACGCAGCGTGCAGGCG ACCGTCCGGGCCC AGCGGC AGCACCGGGACCTGCTCGCCCTGGTGGCCCGGCGG 

>A ALLATTVRSVQATVRAQRQHRDLLALVARR 
89109 GACCCGGAGGTGCCGGGGGCGCTGGTGCTGGACCATCCG AGCGCGGCGGCGTACTGCCTGCCGGGCGTGCGGCCCCGGGTGGTGGTC AGCGC 

>DPEVPGALVLDHPSAAAYCLPGVRPRVVVSA 
89201 CGGGGCGCTCAGCATGCTCGACCGGGCCGAGCTGGCGGCGGTGCTGACCCACGAGCGGGCGCACGCCCAGGAGCGCCACGACCTTGTGCTGC 

>GALSMLDRAELAAVLTHERAHAQERHDLVL 

892 9 3 TGCCGTTCACCGCGCTGTGCCGTGCGCTGCC^TGGTTCCGTIXSGGTACGCGACGCGCACGAGCGGGTCGCCCTGCTGGTCGAGATGCGCGCC 

>L PFTALCRALPWFRWVRDAHERVALLVEMRA 

893 8 5 GACGACAAGGCCCGGGAGCTGCACGCCGAGGCTCCCCTCGCGGGGGCGTTGCGCCGGTTCGCCGCGGCCGGCC ACCGGATCGCGCCGGCCGG 

>DDKARELHAEAPLAGALRR FAAAGHR IAPAG 
8947 7 CACCCTCG^CCTGGGCGACCGGGACCTCGACGTCCGGGTCCAGCGGCTGCTGGTCGCCGACCGGCCGCCCCGGCTGATCGGGGCCGCCGCGC 

>TLGLGDRDLDVRVQRLLVADRPPRLIGAAA 
89569 TGGCGGTGGCGGTCACCCTGXjTCGCGCTGCCGGTCTCCCTCTTCCTG 

>L AVAVTLVALPVSLFLS • 
89660 CGGACACGTCCGACCCGGACGCCTCGCCCGAGTTGGGCCCGTGTCCCACXX^GCCGGCT^ 

89752 GCGATAGGTAGAGAGCCTACGTGTAGTCTTCCTACGACAAGGGAGCCTACTACCGGAGXXSCGX^CCATGGATGAACT^ 

>MDQLLLAR 

89842 TCCAGTTCGCCACGACCACC*TCGCTGCACTTC 

>LQFATTTSLHFLFVVVTLGLVTLLVGLQTAW 
89934 ACGATCACCGGCAATCCCGTCCACGAGCGGCTGACCCGGTTCTG<^^ 

>TITGNPVHERLTRFWGQLYVINYVLGIATGL 
9002 6 GCTCATGX3AGTTCCAGTTCG<XX^T^3AACTG<5AGCGGCCTGTCGCGCTACGT^ 

> LMEFQFGLNWSGLSRYVGNVFGAPLAI ETL 
9 0118 TCGCGTTCTTCCTGGAGTCCACGTTCCTCGGGATGTGGA 

>V AFFLESTFLGMWI FGWHRLRRGVHLALLWG 
9 0210 GTGGCGCTGACCGCGTACGCCTCGGCGTTCTGGGTC ATGGTGGCG AACGCCTGGCTGCAG AACCCGGTCGGCT ACG AGGTGCGCG ACGGGGT 

>VALTAYASAFWVMVANAWLQNPVGYEVRDGV 
90302 GGCCCACCTGACCGACTTCGGCGCGTTGCTGACC AATCCC ACCTTCG^CCTGGCCTTCGGGCACGTGGTCGCCGCCGCCCTGCTCACCGGCG 

>AHLTDFGALLTNPTFGLAFGHVVAAALLTG 
9 03 94 GGATGCTGATGGCGGCGGTGAGCGCCTGGCACCTGATCCGGCGGACCCCGGACCACGCGCTGTTCCGC^ 

>GMLKAAVSAWHLIRRTPDHALFRTSLRIGLV 
90486 ACCGCGGCGGTCTCGATCAGCCTGGTGCAG03CT^ 

>TAAVS I SLVQGFGFAQFGPVGQTQ PTKFGGG 
90578 CGrGCAGCXJCGACGCCCTGGTCGCCGAATGGACCTCCCGGTTCGGGCCCGGCG ACT AC ACCCCGCCCGTGCTGGCCG ACGTCGGGCTCGGTT 

>AQRDALVAEWTSRFGPGDYTPPVLADVGLG 
90670 TCATG ATCC TG ATCGGC CTCCTCC TGGG CTGTCTGTG<5CTGCTGCTCCCCCTCCTCTGGCGGG ACTGGTTC ATC C GGCTG CGC TTC C CGCTC 

>F MILIGLLLGCLWLLLPLLWRDWFI RLRF PL 
90762 TG^CTGATCCTGCTGGCGCTGCCGCTGCCCTTCGTCG^ 

>WL I LLALPL-PFVAVI LGWIAREVGRQPWVA V 
9 0854 CGGGCTGCTTTCCACCGAGCGGGCGGTCTCGCCG^TCGCGCCCGKX^^ 

>GLLSTERAVSPVAPGVMLASLIGFTLLLGG 
9 094 6 TCGCCGTCGCCAACTG<MTGCTGTTCGCCCGGTACGCCG<ZCCG<KX1AGCCG^CGATCCC 
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>L AVANWVLFARYAARGAADPALGRR P G P A A r 
9103 8 GAGTCCCGTCCCGTCCCCGTCCTCGGCTGAGGAGGCCCCTGTGGAACTCGCCTGGTACGCCCTGCTCGGGCTCTTCCTCGCCGGCTACC 

>ESRPVPVLG* >VELAWYALLGLFLAGi 
91127 TGGTCCTCGGCGGCTACGACTACGGCGTCGGCCTGCTGCTCGCCCGGGGCGGCCCGCCGGCCCGCCGCCGGGCGGCCCTCACCGCCGTGGG7 

>LVLGGYDYGVGLLLARGGPPARRRAALTA\o 
91219 CCGTTCTTCCTCGGCAACGAGGTCTGGCTGGTGGCGACCGTCGGCATTCTGTTCGGCGCGTTCCCCACCCTGGAGGGGG 

>PF F LGNEVWLVATVG ILFGAFPTLEGELLS 3 
91311 CTTCTACCCCGTCGTCGCCGCCGCGCTGGCCGGGGTGATCATGGTGACCGTCGGCGTGCAACTGCGCAGCCGCCCGACCGACGAGCCGACCC 

>FYPVVAAALAGVIMVTVGVQLRSRPTDEPT 
914 03 GCGCCGCCTGGGACCGGATGGTGGCCGCCGGGAGCCTGCTCGCCGCGTTCGGCTGGGGGGCGCTGCTCGCCGGGCTGC 

>RAAWDRMVAAGS LLAA FGWGALLAG L. L C G V F 
914 9 5 CTGGCCGCCGACGGGCACGTCACGGGCGTCGGCCACGTGGCCACCCCGTTCGCGGCCCTCGCCGGGCTGGCGATGACGGCCCTGG 

>LAADGHVTGVGHVATPFAALAGLAMTALVAY 
91587 GCACGGTGCGACGTTCCTCACGCTCCGGCTGTCGGCCGCCGACGCCGCACCGCTGGCCCGTACCGCCCGCCGGCTGGTCGCG^-rGGCGCTCG 

>HGATFLTLRLSAADAAPLARTARRLVAY A L 
91679 CCGCCGTCGCCCTGGCCGCCGTCGCGGGCGCGCTCTCCGATCGGGTACGCGCCGCGACGCAGCGCCCGCTGCGGGCCGTACTGJTGCCGTTG 

>AAVALAAVAGALSDFVRAATQRPLPAVLLFL 

917 71 GTACTGGTGGCGGCGCTGCTGGTGGCCCGGGCGGCGCACGCGCGGCACCTGCCCGGGGTGGCCTTCGCCGCCACT7CGGCGGCGCTGGCGCT 

>VLVAALLVARAAHARHLPGVAFAAT£AALAL 

918 6 3 GCCGGTGGCGGGAGTCGGCGCGGCGTTGTGGCCCTAGGCGCTGGTCTCCACCGTCGCACCGACGGCATCACTGAGCGTGACCGAC JCGGCG3 

>PVAGVGAALWPYALVSTVAPTASLSVTDAA 

919 55 CCAGCGGGCCGACGCTGACGGTGCTGGGCTGGCTGGCGCTACCGCTCCTGCCGGCCCTACTAGGCTTCCAGGCG ATGTGCTGGTGGGTGTTC 

>A SGPTLTVLGWLALPLLPALLGFQAMCWWVF 
9 2047 CGGGGACGAACCGACGGCAGGGCACCGGTGTACTGGTGAGCCGCCGTCCCTTCGACCCACGTCTGCTCCGCCGGGTCCC CGCGGCCCGGCG 

>RGRTDGRAPVYW • 
9213 8 CGACCTCGCCGTGCTCGCGGTGCTCGGCGGGCTGACGGCGCTGCTGGTCGTGGGGCAGGCCACCGCGCTGGCCACG GT3CTGGCCGCCGCG 

>VLAVLGG LTALLVVGQATALATYLAAA 
92229 CTCGACGGGCGGTTGGCCCGGCCGGCGCTCGCCGGTT^ 

>LDGRLARPALAGFLAAVVGRALVAWAQGTYA 
9 2321 GGCGCGGGCCGCCGCGACGGTCAAGGCGGCGCTGCGG*3CC GACCTGCTCGCCGCCGTCGGCCGGCACGGTCCCGGGTGGGTGGCCGGGCAGC 

>ARAAATVKAALRADLLAAVGRHGPGWYAGQ 
9 2 413 GGGCCGGGCAGCTCGCCACCCTGGCCGGGCGGGGGCTXSGACGrCCTGGACGCCTACTTCACCGGGTACCTTCCGCAGCTCGTGCTCAGCGTC 

>RAGQLATLAGRGLCA«CAY FTGY L F C LYLSV 
92 505 ACCGTCCCGGTGGCCGTGCTGGCCCGGATCACCTTCGGCGAGTGGGGCTCGGCCGTCATCGTCGCGCTGACCCTGCCGCTGATJCCGGTCT" 

>TVPVAVLARI T F A r W G S A V I YALTLFL I PVF 
92 597 CGGGGCGCTGCTCGGCTGGCAGGCGCAGGCCGCCACCGAGCGGCA3TGGCGGCGG CTGTCGACGCTCGGCGGGCACTTCCTCGACATGGTCG 

>GALLGWQAQAATERQWRRLSTLGGHFLDMV 
92 689 CCGGCCTGCCCACGCTGCGCGCGTTCGGCCGGGCCCGGGGCCAGGTCGAGGTGGTCCGGCGGATGGCCGACGGGCACCGCGCCGCGACGATG 

>A GLPTLRAFGRARGQVEVVRRMADGHRAATM 
92781 CGCACGCTGCGGATCGCGTTCCTGTCCGCGCTGGTGCTGGAGCTGGTCGCCACCCTGTCGGTGGCGCTGGTCGCGGTGCCGGTGGGCATCCG 

>RTLRIAFLSALVLELVATLSVALVAVPVGIR 
9287 3 GCTGCTCGGCG<5CGGGCTGGCGCTGTCCACCGC^ 

>LLGGGLALSTALLVLLLTPEAYLPLRAAGS 
92965 GGTTCCACGCCAGCATGGAGGGGCTGGCCGCGCTGGACGAGGCACTGACCCTCTCCGCCGCCGACCCGACCGCCACGGCCACCGCCGGGTCG 

>R FHASMEGLAALDEALTLFAADPTATATAGS 
9 3 057 CGGCCCGTCCCCGACGGGCGCGCCGAGATCCGGTTCGAGGGCGTGACCGTCGCGTACGAGCGGACCGTGGCGCTACGGGACGTCACGCTGAC 

>RPVPDGRAEIF FEGVTVAY ERTVALRDVTLT 
9 314 9 AATCCGGCCCGGCGAGCGGATCGCGATCG'i CGGGCCGAGCGGCGCGGGCAAG AGCACCCTGCTCAACCTGCTGCTCGGCTTCGTCGCCCCG A 

>IRPGERIA1VGPSGAGKSTLLNLLLGFVAP 
9 3241 CGCAGGGCCGGGTCACCGTGGGTGGCGTCGACCTGGCCGGCGCGGACCCGGACGGCTGGCGGCGTCAGGTCGCCTGGGTGCCGCAACGGGCC 

>T Q G R V T V G G V D L A G A D P D G W R R Q V A W V P Q R A 
9 3333 CACCTCTTCGCCGCCTCGCTGACCGACAACATCCGGCTCGGTGCCCCCGGCACGCCCGACGCCGCGCTCGCCGGCGCGGTCGCCGCCGCCGC 

>HLFAASLTDN I RLGA PGTP DAALAGAVAAAA 
93425 GCTGGACGAGGTGGTCGCCGCCCTGCCCGACGGGCTCGACACCG 

>LDEVVAALPDGLDTVLGERGHGLSSGQRQR 
9 3517 TCGCCCTGGCCCGGGCGTTCCTGCGGGACGCGCCGGTGGTGC^ 

>V ALARAFLRDAPVVLLDEPTARLDTASEAGV 
93609 CTGGCCGCCACCCGCCGGCTCGTCGCCGGGCGAACCGCCCTGTTG^ 

>LAATRRLVAGRTALLVAHR PALLSDADR I L R 
9 37 01 GGTCGAGGAAG<3CCGGGTCACCGAGCTGACCACCACCCCG^CCACAGaX^T 

>VEEGRVTELTTTPATGVTPGPGEAAAGPAG 
93793 AGGTCGCCCCCGCCCCG<3CCGGAGAGGGGGCGGCCCGATGAGCACCG<rK:CCGCCGACGACGCCTTC 

>QVAPAPAGEGAAR • 

>MSTGPADDAFA I PLPADG 
9 3884 GCCCCGGTGGCCGGCGGCAGCGTCCGGGCCGCCGAGCG<X;CCGTGCTCC^ 

>APVAGG SVRAAERAV LR LARPYLGRLVGAG L 
9 3976 GCTCGCCGCCGCCACCGAGTTCGCCGGGCTGGCCCTGATGGCCACCGCC^ 

>LAAATEFAGLALMATATWLLMSAAGRPPLD 
94 0 68 GGCTCACCGTGGCGATCGTCGCGGTCCGGGCGCTGGCGATCAGCCG AGGCGTGTTCCGCTACACCGAGCGCCTCGCCGGCCACGATGCCGTG 

>R LTVAIVAVRALAI SRGVFRYTERLAGHDAV 
94160 CTGCGGATGATCACCGACGTCCGGGCCG<SG<^CTTCGCCGCCCTGGCCGCCCGGCGCGACGCCGCCCGGCAGCGCACCGGGGACGCGCTGAG 

>LRM I TDVRAGV FAALAARRDAARQRTGDALS 

942 52 CCGGCTCGTGTCCGACGTG^AGCCCGTGCAGGACCTGCTGCTGCGG^ 

>RLVSDVEAVQDLLLRVLVPGAAATVVSVLA 

943 44 TGGCCGGX^CCACCACCATCTCGCTCCCCGCCGCCGGG^TGC^^ 

>VAGATTISLPAAGVLALGLLVAGVALPLAAT 

944 36 GCGCTGACCCGGCACGCCGCCGACCGGGTGGCCCCGCTGCGCGGCGCGCTCGCC ACGGACGCCGTGG ACCTTGTCC ACGGCGCCGCCG ACCT 

>ALTRHAADRVAPLRGALATDAVDLVHGAADL 
94 528 GGCCGCGTTCGGTGCCACCGGGTACGCGCTGG ACGCCGCCGCCGATCGGGCCCGCCGGCTGGCCCGGCTGG AACG ACGGCTCGCCGCC ACCG 

>AAFGATGYALDAAADRARRLARLERRLAAT 
94 620 GCTTCGCCGTGGACGCCGCCGGGGCGCTCGTCGCCGGGGTGACCGCCGGCAC 

>G FAVDAAGALVAGVTAGTVVVTALRDGVGGV 
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94712 CTGGTCGGGGTGCTGGCGGTCGGTTCC^ 

>LVGVLAVGSLAAVEVALALVGAARQRTC^FA 

94804 CGGGCTGGTCCGGGTGGCCGCCCTGCTGACCGCCCCGCAGGCCGACGCGCCCGCCGCCACCCCGCCCGGTGCCGCCCGTGCCGCC3CCGTCG 

>GLVRVAALLTAPQADAPAATPPGAARAA^AY^ 
9489 6 GTGCCGGCCCGCACGACGTGCGCTTCGACGCGGTCACCGTGCGGTACCGGGCCGGCACCGCCCCGGCCCTGGACCGGGTCA^^ . . . . . 

>GAGPHDVRFDAVTVRYRAGTAPALDRVT^D^ 
949 88 CCGGCCGGCCGCCGGGTCGCCGTGGTCGGGCCGAGCGGCGCCGGCA^ 

>PAGRRVAVVG P SGAG KSTLAAVLTGTV R P E ^ 
95080 GGGCCGGGTCACCCTCGACGGGGCCGACCTGTCGGCGTACCCGGTCGAGGAACTGCCCCGGGCCGTCGGCGGCCTGCTCGCCGA^ 

>GRVTLDGADLSAYPVEELPRAVGGLL « Y^ 
9517 2 TCTTCCACGCCACGGTCCGGGAGAACCTGCTGCTCGGCCGGCCCGCCGCCGACGAGGCGGAGCTGACCGCCGCGACC. GG^^^ „^ . o 

>VFHATVRENLLLGRPAADEAELTAATRAAo^ 
95264 CTGGACTGGGTGCACGCCCAGCCGGCCGGGTGGGACACCGTGGTCGGCGAGGAGGGCGGACAGCTCTCCGGCGGCCAGCGGCAGCo^ ^T^'o^ 

>LDWVHAQPAGWDTVVGEEGGQLSGGQRQRL_ rt 
9 53 56 GCTGGCCCGGGCGCTGCTCGCCGCGCCCGGGGTGCTGGTGCTCGACG AGCCGACCGAGGGGCTCGACCCGTCCGCCGCC^ACoCG^TG ^ TV ^ 

>LARALLAAPGVLVLDEPTEGLDPSAADAV^L 

95448 CCTCGGCGCTTCCGGCGACCCCCGCCGGG^ 

> A SALAATPAGHSVLLISHRLSGLADLDEIY\ 
9SS40 CTCGACGCCGGCCGGGTGGTCCAGCGTGGCCGGCACGACGAGTTGGTCGCCGCGCCGGGCTGGTACCGGGACCAGTGGCTGCTCCA^oAGo^ 

>LDAGRVVQRGRHDELVAAPGWYRDQWLLQEA 
95632 GGCCGAGCGCGGGTACCTGGCCCTGACGCCCCGCCCCTGAGCCGGCTCCGGGATTCCCCCGGACGCGTCGGCAGTCACCGCATGGCAGGCT 

>AERGYLALTPRP 
95723 CGTCGCATGGTGCGCTGCGACGACGTACTCGTGAAGGAGCGGCTGCGCGAGTTGAGCGACCGGCTGCACGGCCCGGCACGGCTCAAGGCCG 

>MVRCDDVLVK ERLRELSDRLHG PARLKA 
9 5 814 ACCTGCTGGCCGAGGCCCGCC ACGCGTTGC AGG ACGCCGTCG AGGCGTACCGGGACGGCGGGCTGCCGGCGGCGG AGGCCG AGCGGCGGGC A 

>DLLAEARHALQDAVEAYRDGGLPAAEAERRA 
9 5906 GTGGCCGAGTTCGGCGAGCCGGCCCGGCTCGCCCCGGCGTACCAGGCGGAGCTGGCGGCCGGGTCGCTGCGCGGCCTGTCCCTGCGGGTGCT 
>VAEFGEPARLAPAYQAELAAGSLRGLSLRVL 
959 9 8 CGCGGTCGCCGGCGTCCTGGTCGTCGCGGGCGATCTGACCTGGC AGGGGTCG AGCTGGAGCGGCGGCCCCGGCCCGCCGGCGGC CTACCGCC 

>AVAGVLVVAGDLTWQGSSWSGGPGPPAAYR 
96090 TGCTGTCCGCCTCGGTGGACGGCATCTGGCTGGGCGCGGTCGTGCTGTCGGTtiGCGGGGTTGCTGCTGGTCGCCGCCTCGGCCCGGTGGGCG 

>LLSASVDGIWLGAVVLSVAGLLLVAASARWA 
96182 CACCCGGCCCTGCCCCGGCTCGCCCGGCTGACCGGTCTCGGGCTCACCGCCACGCTCGTCCTGGGCGTGGCGACCGGCGCGGCCCTGTACGC 
>HPALPRLARLTGLGLTATLVLGVATGAAL Y A 
96274 CTGGTCGATCGGGCTCTGGGAGGCGGCCCGCACCTGGrCGCCGATGCTCGTGGGCGCGCTGCTCTGCGGCGCGGGGTTCTTCTGGri * CGG . 
>WSIGLWEAARTWPPMLVGALVCGAGFFt%IG 

junction 



96366 GG<5CGGCCCGGTCCTGGCTGCTCTCGGCACGCCGACCGGCCGGACCGGCGTAGTCGGGTGGGCGGCGGGCGCGGTCAGGCCGGCGTGGCGG 
>R AARSWLLSARRPAGPA * < • A P T A P 

9 6457 GGGTGTCGCCGAGGAACTGGCCGACGGTCGCGCTGAACTCCCGCCAGCCGGCCCGTTCCCCGGCGAGGGCCCGGCGGCCGGAGTCGGTCAGC 
<TDGLFQGVTASFERWGAREGALARRGSDT L 

96549 TCGTAGGTGCGGCGCTCGCGKSCCGTTGACGGTGCTCCAACTGCTGACCACGTGCC^ 

<E YTRRERGNVTSWSSVVHGARELRRLAPY IT 
96641 CCCGGTAGGCAGATCGAGGCTGCCCTXTGCTGCGCGCGCGCAGCGCCTCGATGATGGCGTAGCCGTGCAGCGCGCCCCGTTCCAGCACCGC 

<GTPLDLSGESRARLAEI IAYGHLAGRELVAL 
96733 GCAGCAGCGCGTCGAGGTGTCCGTGCAGCGCCTCGGCCTTCATAGGTA 

9 6 8 2 4 < GCACCGGGCCTCCTAAGCCG^ 

>VG S P KH*T 

9 6914 GGAGGTCAGCGTGGCCCGCCAGTCGCCCCAACGGCCCGACGCCGACGAGCCTG AGCTCG ACG AG ACCGACGGC ACCGCCGCAGAGGTCG AAG 
>EVSVARQSPQRPDADEPELDETDGTAAEVE 

97006 AG<SACG^CGCGCGCCCGTCGGCGCAGGACGCCGACCGCGCGCTCTGGGACGAGCTGCGCATCGACCCGGTCGAGATCGCCCTGCCCGCCGGC 
>EDGARPSAQDADRALWDELRIDPVEIALPAG 

97098 ACCGGCT ACACGCTGC GGGCGT AC C GGCCGGC ACGGGAGTTG AC C C CG ACC G ACGTCG CCGAGCG CG AC C AGG A CG AC C C GTTCCTGGCCCG 
>TGYTLRAYRPARELTPTDVAERDQDDPFLAR 

9719 0 CCGGCAGGCGGTCGAGACCGACGAGGACG AGG ACGAGGTC ATCATCCTCG ACGAGGAGGTGGCCGCCG AGTTCGCCG AGGCGGACGCGG AGG 
>RQAVETDEDEDEVI ILDEEVAAEFAEADAE 

97282 AGGCCG<3CGG<5AAGTCCCGCTCCCGCAAGCCCCGCGCCGACGCXX>ACTCCGACGACGCCGGAGCCGCCACAGACGCGGACGCGGAGGAGGAG 
>EAGGKSRSRKPRADADSDDAGAATDADAEEE 

9737 4 CCGGACTCCGACGAGGACGAGGCGGGCGACGAGGAGGTTCCGGTCT^ 

>PDSDEDEAGDEEVPVFLSHRGRLLLFKTPES 
97466 CCTCGTC AGCTTCGTCCGGTCCGGCGCACCC AACG ACATGTCTC AACTCG AC AGCTGGAATC AACTGTCCG AACG<X5TGGAGCCGGCCG AC A 

>LVSFVRSGAPNDMSQLDSWNELSERVEPAD 
97558 TCGTCCCGCTCGACG AGGACACCTACG AGCTCGACCTGGTCGTGGAG AACCTCCGG<X;TGGGC ACC^CACCTGGG ACTCGGCGCTCCTG ATC 

>IVPLDEDTYELDLVVENLRGGHDTWDSALL I 

97 6 50 GAGCCGXSCGAGGTGGCCCGGGACGTCGCGTATCCCCTGCGT^ 

>EPARWPGTSRMPCVCPPCWTCSPPAPASTTW 

97742 GACGAGGCGCTGCGCGCCACGGCCAACGGCGGGCTCGG<^ 

>TRRCAPRPTAGSGASSAAGG • 
97 83 3 TCGGTTGGCGCACCATTGTCGGCAAGATCTCTGCGGTCGTGGACTGGCC^ 

BamHI 
junction marker 

97 92 5 ACCAGTCCCGGGAG<1AGGACGACGCTGTGGCGCTCGTX5CGCGTGTACTGCGGTCT^ 

98017 GGTTCGGCGCTG ACGTCCGCTGTGGTCG ACG ACGC AGGTCC^CTCCTCCATGTCTGCGAGATCGGCG ACG ACCC AGCCGGCT ACGCTC AC^T 
9 810 9 GGTCGTGCTACTCGTGGAGCGGTCGGGCGGGCCGAGCGGGGCTGCGATCGCCGCCG ACAGCGACG ACCACACGGTCACCTCGCTGCTGAGTG 
98201 CCGCCGGGCGTCCACTGGCGATCGCGGACGACGACTCXX3TGX3ACGACTTCGCCGAGCGGTTCGCCGACGACGACTCG 

> M Q 

98292 GTCCGCGCCGGCCGAGCGGCGGGCGGTC^^ 

>SAPAERRAVGLARALQAGALSAVTLPA PRO 

983 84 TCGCCGGCTACAAGCAGGTCCTCTCCtGCGCACGCCGC<jCTCGCCAGCGGCC 

>LAGYKQVLSAHAALASGRHSAAVALREVLRE 
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9 847 6 CTCTACCCGGCCGCCCTGCGCGCGTACCCGGACCCGGCCGAGCCGGTCGCCCTGGCCGTGTTGGACGCCCTGCCCG AGCCCGGC ATGCTGCG 
>LYPAALRAYPDPAEPVALAVLDALP E P G M L J 
9 856 8 CGGG ACGATCGCCCGGGGCCGGG AGGTGTCGGTGGCGGCGG ACGCC ATCGCCGCCCACCTCGCGGCCGACGGGGTGGCCG ACG AAGGC AA3 A 

>GTI ARGREVSVAADAIAAHLAADGVADEGK 
9 8660 TCAACGATGCGGTGACCGCGCTGCGGGTCGCCATCGCCGAGACCCCCCGCCGCGCGGCGGTCAGCCGGGCGCTCACCTCCGCCGTGGCGGAG 

>INDAVTALRVAIAETPRRAAVSRALTSAVAE 
9 8752 ACGGTCCGTCAGGCGGTGGCCTCGGTGCGAGCATGCGACGCGGGCTGCGAGGCCCTGGTCGGCGCGCTCGACGCCCGGGTCACCACCCCCAC 
>TVRQAVASVRACDAGC EALVGALDARVTT PT 
98844 CCCGGTGCCGGGCCGGCGGGCCGCCGCCCGCCGGGGCGAGCCGGTCGCCGAGTlK^CCGGCGCCGGCCTGCGCGCACTGCGGCCCACAGAG'r 

>PVPGRRAAARRGEPVAELPGAGLRALRPTE 
989 3 6 CCGAGCCGGTCCCCGGCCGGCGCAGCCGTCCCG AGCCGGTCCCCGGCGGCAGCCTGCCCGCCCAGCCCCGCCCGCTCGGCCCGCCGCCGGTC 

>PEPVPGRRSRPEPVPGGSLPAQPRPLGPFPV 
99028 GCGCCGGAGCCCGTCGCCCCGCCGCCGGTCGCCCCGCGCCCGATCACCCCGGCCGCCTCGGCCACCCCGCCGGTCTCCGGCC CGCCGTCGCC 
>APEPVAPPPVAPRPITPAASATPPVSGPPSP 
9 912 0 CGAGCCGCGCCGCCTGATCGACAACCCGGCCAACCGGCCGGTCTCCGCGCCGCCGCCCCCGCCACCGGGGATCACCCCGATCGCGCCGAGCC 

>EPRRLIDNPANRPVSAPPPPPPGITPIAFS 
9 9212 AGCGCGAGCGCGGGTCCGTGCCGCCGGCCGAGGCCGGTGAGCCGTTCCGGCCCACGCTGACCACCGCCGCGATCCAGAACGCGCGGGCGGAG 

>QRERGSVP PA EAG EPFRPTLTTAA I Q N A R A E 
9 9 3 04 CGGCAGCGCACCATCATCCCGCCTCGCCCCAAGACGACGGGCGAGTCCGCGCCGCCGCCCACCGGCGGCTTCAGCGCCACCGACCTGAGCGT 
> R Q R T I I PPRPKTTGESAPPPTGGFSATDLSV 
9 9396 CCCGGTGCCGACCCCGCGTCCCGGCCAGGAGTCCGCTCCCCCCGGCTCGCGGGCGAACTGGCCGCTGGTCAACAACCCCGAGGACCCCGCCG 

>PVPTPRPGQESAPPGSRANWPLVNNPEDPA 
9 9488 ACAGCTCCCCGAACAATCCCGTCGCGCGGCGGCCCTTGGAGGATCGGGCGAAGCGGCAGATCGACGCGCCGACCCAGGTGGTCCCGCCGGCC 

>D SSPNNPVARRPLEDRAKRQIDAPTQV VPPA 
9 9580 G AGGGCCGGGTCACCCCGCCCTGGCTCGCCGACG ACCTGCCCCAGG AGCCACCGATGCTGCGGCTGGTCG AGCCGCC ACCGCTGGCCG ACCG 
>EGRVTPPWLADDLPQEPPMLRLVEPPPLADR 
9 9 672 GGCACTGCGGGATGGGCCGGGCCAGGCTGCCGACCCGCGCCTGGAGCCCCCGCCGCTGCGGCTGGTCGACCGCGGCGAGGCAGCCCGCGCCG 

>ALRDG PGQAADPRLEPPPLRLVDRGEAARA 
9 97 64 GCCGTCCCGCGCCGGAGCCCCGCCCCGAGCGGG"CCCCGCGGAACACCGGTCCCCGGTGGGTCAGCGGGTCCCGTTGGAGGAGCGGCCCGAC 

>GRPAPEPRPERAFAEHRSPLGQRVPLEERPD 
9 98 5 6 ATGGAACATCGGAGCGCCCCGCCGCAGCGGTCGCGGTCCGCGCCGATGGAGCGGCGTACCGCGCCGATCTCCGACGAGGGGGACGGCGACCT 
>MEHRTAPPQPSRSAPMERRTPP I SDEGDGDL 
9 99 4 8 GCTGATCTTCGCCGCCGCCAAGTCGGCCTGGTTCGTCGGGCACGGCGACGAGTCCGAGATGGACTGGTCGAGCACCGCCG ACACCGGGTGGC 
>LIFAAAKSAWFVGHGDESEMDWSSTADTGW 
10004 0 AGGCCGCCGAGCAGGCCGCCCGCCCGGCGGTGGGCGCCGATACCAAGGCCGGGTTGCCCAAGCGGGTGCCGCAGGCCAACCTGGTTCCGGGC 

>Q AAEQAARPAVGADTKAGLPKRVPQANLVPG 
100132 TCCCCCCTGCGCGAGGAGCGTCCCCTACGGATAGTCCGCGACGCGGCCAGCCTCGCCGAGAACACGACCGGCTACTTCCGGGGCTGGCGTCG 
>SPLREERPLRIVRDAASLAENTTGYFRGWRR 
100224 CGGGCAGGAGATCGGCGGGTTCGCCGTCGGCGGCCGGCCGGGCCGCGAGGCGGCCGGCGGCTGGGACTTCACCCGGGACACCGGCGACCGAG 

>GQEIGGFAVGGRPGREAAGGWDFTRDTGDR 
100316 ACGACGACCGGGAGTACGAGTACCGGTCCGCCGGCTACCGCTCCTGACCATCGCGCCGGGGTGGCGGGCACCGCCGCCCCGGCCGCGCCCG 

>D DDREYEYRSAGYRS 
1 004 07 CCGTGCTGGCGTACGGCCGTAGCAACCCGACGCCCACCCGACCGACCGGTGGGACCACACCGAAGAGCAACCCTGGCCGGCCGCCTGACGTC 
100499 CCGCAGGGACG^TGACGGCTACTGGCCGTCCCCCCGGGAAGGTTGCGAGGCGGTCGGGGCGCACAGGCGCTGTCAGGGCCGCTCCTGAGCCG 
100591 CCCTACGGAATGGGCTAGCCCTACGGAATCGAGCGCCGGCCGCCCGCCGGTGGCGTCGAGTTCGCCTCCAGCCGGTCCCCAAACAGCCGGAC 
100683 GCGCCCGACGGCCCGGCGGGTGCCGGTCGTCGGGCGCGTCCCGTCGTCGATCAGGCCGGCGCGACCGCGGGCGAGCGGCGC ATGGTGAGCA 

<• APAVARSRRMTLV 
1 0 077 4 CGTACTCGACCAGCGAGATCAGCACGTGCTTCGTCGACTCCCGGTTCCGGGCGTCGCAGGCCACCACCGGCACGTCGTGCGAGATCGCGAGC 

< YEVLS I LVHKTS ERNRADCAVVPVDHS IAL 
100866 GCGTCCCGGACGTCCTGCGGGTCGTGGTACTGCATCCCGTCGAAGCAGTTGATGGCCACCAGGTACGGCAGCCGCCGATGCTCGAAGAAGTC 

<A DRVDQPDHYQMGDFCKIAVLYPLRRHEFFD 
100958 GATGGCCGCGAAGCAGTCGGCCAGCCGCCGGGTGTCGACCAGGACCACCGCGCCGATGGCGCCCCGGACCAGCTCGTCCCACATGAACCAGA 

<I AAFCDALRRTDV LVVAG I AGRVLEDWMFWF 
101050 ACCGGGTCTGGCCGGGGGTGCCGAACAGGTACAGGATCAGGTCCCGGTCGATCGAGATACGGCCGAAGTCCATCGCCACCGTGGTCGTCGTC 

<RTQGPTGFLYL I LDRD I S IRGFDMAVTTTT 
101142 TCGCCCGGX3ACCTGCCGGGTGTCGTCGACGCCCACGCCGGCGGAGGTCATGATCGCCTCGGTGGTCAGCGGCGTGATCT 

<E GPVQRTDD.VGVGASTMI AETTLPTIESVSG 
101234 GACC AGCGTCGTCTTGCCGACGCCGAACCCACCGGCGATAACGATCTTCGCCG ACGTCACGCGCCCGCTCGGG AC AGGCGGGCGGTGCG AC A 

<VLTTKGVGFGGAIVIKASTVRGSPVPPRHSM 
101326 TGTC AG AGCCTGCGAAGTCCACTC AGC ACCCTCTCC AGC AGTTC AGTGCCCACCGCGTCGTCGGAGTCGTCCAGG ATGGTCGGCTCGTGGA 

<<• LRRLGSLVRELLETGVADDSDDL ITPEHV 
101417 CTGCGACCAG<»CCGTCCGTCGCCATGTCGGCGATGAGCACCCT^ 

<AVLGDTAMDAI LVRAVGLPLQMRAAI EALS 
101509 TGC ACGCGTC CGTCGCAC AGCGCG<JCG ATGT ACTGGTGCTCTCGGC C CTGGC C A CCGTTGCTACTGGC AGCGGC C C G AC CGCGC ACCG TCGT 

<QVRGDCLAA I YQHERGQGGNSSAAARGRVTT 
1016 01 CTCGACGAGCGCCTCCAACGCGATGTCCAGCCGAGGGCGGGTACGACCGCGGGT^XACGGCGTACGGACGGACCAACGCGCCAGTCGGCTCGT 

<EVLAELAIDLRPRTRGRTVAYPRVLAGTPED 

1016 9 3 CACGATCCATGTCGCCGCTGACCTCCITCGTCCCCGACACCGGCTGAACCCGGTGGAACCCGTCGTTCTT^ 

<RDMDGSVEKTGSV 

1017 84 CCATCGGCCAGCGCGTGG<3TCAGCCCATCATCCCCACAGTCGTACGCGGCTGC 

<• GMMGVTTRPQPTLADGVRDVLLA 
101875 CATCTCGTATCCGACCTGCCCGACGTCGCAGCTACGGGCGGCGAGCACGGC 

<MEYGVQGVDCSRAALVAFSSGDS I SMLFLFG 
101967 CGTTGTCCATCTCGACCACGGTCTGCAGCACCGC ACCGCCCTCG AAGCAGCGTGCCGCTCCCTGCGTG AGGCTG ACC AGCCCGGACGCG ATC 

<NDMEVVTQLVAGGEFCRAAGQTLSVLGSA I 
102059 GCGGCGAGCTGGTCGGCCCGGTCACGCGGAAGGTCT03TGACGACGCCAGGAGCAGACCGTCCGCGGAGACGGCGACCGCGTGCGCGACACC 

<A ALQDARDRPLDRSSALLLGDASVAVAHAVG 
102151 G<X3CACCCGGTCGGCGAAGTTGGCCAGCAGCCAACCGAGATCCTGCGTAGTTGTCATCCTTGTTGCT 

< P V <• GQQEKQGSGAVP 
102242 GGCCTGAGCCAGACTGCGAGGATTGCrrcCCCACCCGGAGCrrGCCTCCGGGTTGGTCGGG^^ 

<GSGSQSSQQGGPAAEPNTPNGDPETRGRQV 
102 33 4 CCTCGATGGTATGCCGAGAGCAGACCGCGGACGCCCTCCGGCGTACGGCGCTGGA^ 
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<GRHYASLLGRVGEPTRRQVSTTPKEVGGPVL 
102426 TTGGGCCATCGGCACCCGCTTCGGCAGGCCCTTGCGGGTGGTCTCCGCCACCGGGACCTCGXSTGGCCGCCGAGGCGG^ 

<OAMPVRKPLGKRTTEAVPVETAASAARWGDr 

102 518 CCGCGGCAGTCTGCCAGGCGTGGGCCTGCGGCGTGGGCCGCCGGCCGGCGAAGCCCTCGGCGGGACCGGGGCGGGTGCC ACCGTTGGTCGGC 

<AATQWAHAQPTPRRGAFGEAPGPRTGGNTF 

102610 GACCCGTTGTCGCGCGGCATTCCTCCGGCCATCGGGGTCTCTGCCATC 

<SGNDRPMGGAMPTDAMPANGTTGPAPTQVPR 
1027 02 GCCGGTGACGTCGACCGCGGAGAACTGTTGGGTCACGGCGGCGTTCGCCGGGCTCCCGGCGCCATTGGTCGCGCGCTGGGCGACGCCGGCCG 

<GTVDVASFQQTVAANAPSGAGNTARQAVGAT 
102794 TCTCCTCCGAACCCGAGCGGCGGGTACGGAACCAGGCCGACTCGAGCTCCCGGAAGATCGGCAGCTCCATCGTCTCGTCCGCGTACCGCTGC 

<EESGSRRTRFWASELERFIPLEMTEDAYRQ 
102886 TGCCGGTTCTGG<3CCTGCACGGGCGTCGACCGCGCCGGCGTGGGCGGGGTCGTCGGCTGGGTGGCCGGCGTGCTCGGCACCTCGGTGCTCGG 

<QRNQAQVPTSRAPTPPTTPQTAPTSPVETSP 
102978 CACCCGGGGCAGCTCCGTGGTCATGTCCAGGGCTGCGGCGAGGCGCTCCGGCACCGGCGGGGTGACCGGCTCCGGCGCGGCCACCGGCGGCC 

<VRPLETTMDLAAALREPVPPTVPEPAAVPPW 
10 307 0 AGGCCGGCGGCGCCACGGGCGCCTGGGCCGACGGGACCGGCCGACTGGGCAGCGGCTGCGCCGAGTACGGCTGACCGGACACGGGCGTGCCG 

<APPAVPAQASPVPRSPLPQASYPQGSVPTG 
10 3162 AACGGCTGACCGG AGACGGGCGTGCCGAACGGCTG ACCGG AC ACCGGGGCGGCCGAGACGGGCTGACCCG AGACCGGGAAG ACGGAAACCGG 

<FPQGSVPTGFPQGSVPAASVPQGSVPFVSVP 
103254 CGGTGCGGACACCGGCGGCACCGAGACCGGTGGCGGGGTCCAGCCCCGGGCCTCCGGGCTGCTCGGCAGCTGTCGGGGGATGGCCGGCTGCT 

<PASVPPVSVPPPTWGRAEPSSPLQRPIAPQQ 

103 34 6 GGCCGCTGCTGGCCGGATCGCCGTCGCCGGACGCCCGCCGCTGCGGCAGCGGGTCGCTCGACTGGCCATTCGAGGTGCGGCCGGCGGCGGC A 

<GSSAPDGDGSARRQPLPDSSQGNSTRGAAA 
10343 8 CCGCCAGCGGTGCCACTGGCCCCGGTCAGGTCCGACC AGGCCGGCATCGACCGCATGGAACCGGTCGACGCGGGCGTGCCGTGGCCGTTGCG 

<G GATGSAGTLDSWAPMSRMSGTSAPTGHGNR 
103530 CGAGGCCGGGTCGAACGACCGGCCGCCCAGGGTTACCTGGTTGCCCGAGTGCCCCGGCCGCTGGGTGGGCGCCGGTGTCGCCGGGTTGTTGC 
<SAPDFSRGGLTVQNGSHGPRQTPAPTAPNNG 
103622 CGAAGGCCGCGAAGGCACCCAGGGCCGGGGGGGCCCCGCCGGGCTGCGAGCTGGTGAGCGAGGCGGGAGGGGCCGGCAGCGCGCCGGGCTGC 

<FAAFAGLAPPAGGPQSSTLSAPPAPLAGPQ 
103714 TGGAACCGGCCGGAGAGCGCCCGGGGCACCAGCACGGTGGTGGGC AGGGTG ACGTCGGCGACGGTCCCCCGGTCGGTGCCGGGCCGC AGCTC 
<OFRGSLARPVLVTTPLTVDAVTGRDTGPRLE 
^ 103 806 GACCTTGACCCCGTGCCGGGACGCCAACCGGGCG ACC AC AACCAGGCCCATC ATCCGGGAGACGGCC ACGTCC ACCTGCGGCGGCG AGGCGA 
' J <VKVGHRSALRAVVVLGMMRSVAVDVQPPSAL 
3 103 898 GGCGGTCGTTGAGGTCGTGTAGCTGCTCGGCGCTG ATGCCG ATGCCCCGGTCCTCGACGTAGAGGTTGGCCCGGTCGCCGACCCGGCGGGCC 
s <RDNLDHLQEASIGIGRDEVYLNARDGVRRA 
'' 4 103 9 90 TCCACCATCACCTGCGAGTCGGGCGGCGAGAAGGCGGTCGCGTTGTCGAACAGCTCGGCGACCAGGTGG ACCAGGTCGTTGACCGCGTGCGC 
H <E VMVQSDPPSFATANDFLEAVLHVLDNVAHA 

- 104082 GGCGACCTCGATGTCACGGTCGATCACCCCGAACTCGATCCGGGTGTAGTGCTCGACCTCGGACTGGGCGGCCCGCAGCACGTCGATCAGTG 
y <AVEI DRDIVGFE I RTYHEVESQAARLVDI LA 

^ 104174 CCGCCGGCTCGCGCTGCACGCGG<3TG<5AGTCGGCCCCGGCGAGCACCAGCAGGTTCTCGTCGTTGCGGCGCATCCGGGTGGCCAGGTGGTCG 
« <APERQVRTSDAGALVLLNEDNRRMRTALHD 
; ' 104266 AGCTGGAAC AGCTCGGCC AGCCGGTCCGGGTCCTCCTCGCCGCGCTCC AGCCGGTCG AGGTGGCCGATC AGCCGGTCG ACCAGG ATCTGCG A 
2 < LQ FLEALRDPDEEGRELRDLHGILRDVLIQS 
" 104358 ACGGCGGGCCAGGTTGACGAACATGGTCGCGACGGAGGCGCGCAGCGCGGCCTGCTCCGCCGCCGTCCG^ACGGCCTCCAGGTGGACCGCGT 
<RRALNVFMTAVSARLAAQEAATRVAELHVAN 
^ 104450 TGAACGCCTCGGTCACCTGGCCGAACTCGTCCTTGCTGCGCACCGGCAGCGGCTCGGCGATCTGGTTGGCCGCCTGCACCGGGGAGAGCTGG 

* <FAETVQGFEDK£RVPLPEAIQNAAQVPSLQ 

55 104542 CTCTCAAAACTGCGC^TCGCGCAG^ 

a-. <S SFQPDRLRAVAQPLGYQAI SLAGQRL-DRLS 

104634 GCGGGCCATCGACCGGGCGACCAGGTACGCGAACAGGATGGCCAGCAGCAGCATGXTCGAGCAGCAGGCCGGTCTGGAGGAACACCGTGCGXT 

* <RAMSRAVLYAFLIALLLMGLLLGTQLFVTRQ 
; 1 10472 6 GTACGTCGGAGCGGAGCGCGTCGGCCTGCTTGACCACGTTGCCGTCGAGCTTCGC 

<VDSRLADAQKVVNGDLKAEVTR I LKASAVM 
" 104818 GCCGCGTCCCACTGATCCGGCCCGAACGGCGCGTTGGCCATGCTGCCGTTGGTGTTGCCGTCGAGCCAGCCGGTGT 

<A ADWQDPGFPANAMSGNTNGDLWGTYNQAER 
104910 CCGGTCGCCGCCGGCGACGGTCTGGTCGTGCAGGTCCGACTCGTCCAGG^ 

<RDGGAVTQDHLDS EDLS AVAK F S QLAQQQGT 
105002 TGCCGCTGGCGATGTAGTCGGTGCGCAGGATGGG<XjTCAACTCGCGCTGGATCAGCGCCCGGTG 

<GSAIYDTRLI PTLERQ I LARHVVVRRVSLY 
105094 TCCTTCTCCCGGGCGACGXSCTGCCGCGGCCCGCATCCGCTCGCTCA^ 

<EKERAVAAAARMRDSLDNDGALHTASDRI SL 

105186 CAGGTCGTTGATCAGGCCCTCGTACGCCTGCATGGCGTCGAra^ 

<LDNI LGEYAQMAD I I KLKGNFVQSRTG PLDK 
10527 8 TCAGGTTCTGGTCGATCCCGTCGAGGAGGCCCTCCAGGCTGCTCGGCAGGCCGTCGACCTCCCC 

<LNQDIGDLLGELSSPLGDVEGRQQLYPVKD 

105370 TGGTCGACCCGGATGTTGACCCGGTTGTACGCCTCCTGGTACTG^ 

<QDVRINVRNYAEQYQAKAQDGSAGLLLVAST 
105462 GCGTTCGTCCTGGAGGCTGTTGACCAGGTCGCCCG AGTAGCCCACC AGATTGGCCAGGTCGCCGG AGCGGTTGGCGTTGTTCAGCGTTTCC A 

<REDQLSNVLDGSYGVLNALDGSRNANNLTEL 
105554 GGTTGTCGACGAGGCC ACTGGTGCCGACG ACGACCGTGGCG ATGGTCGGCACG ATC ATG ATGAGACCGAGCTTGGACC AG ATCGGC ATGTC 

<NDVLGSTGVVVTAITPVIMILGLKSWI PM 
105645 GCGGAGCCGGCCGACCGGCCGACGC AGTCGCG AC AGGAAGGAACCCGCCGTCTTCGGTCGTTTGCTC ACGTCACCGCCCTCGCGATC AC AGC 
105737 GTTCGCGCGTTGCCCCGGGCAACGCTCAGCGACCGACCCGGCGGGTCGGACCTCCGAGATTCCATCACGCCGTGTTCCAAAGAGAAAGCCCA 
105829 GGCTGGCCGTCGCCGGAGGTGTGATGAGATGTTGATGCAATTTGCTCGCAATCCGTCCAGCCGGAGTGACT^ 

> M D H P 

105919 C ACCGCCTCGTCCTGCTCGCCGGCCCTTCGGGCTCCGG AAAGTCGTAC ATAGCCCAACAAACCG<^TTCCTGTTCTTTGTCTGG ACG ACTT 
>HRLVLLAGPSGSGKSYIAQQTGLPVLCLDDF 

106011 CTAC AAGGATGGTGATG ACCCTACGTTACCGCGCC AAAACGGTCTTGTGGACTGGGACTC ACCCCAGTCGTGGGACGCCGGGGCGGCCGTGG 
>YKDGDDPTLPRQNGLVDWDSPQSWDAGAAV 

106103 AAACGATTGCCCGGCTGGCGCGGGACGGCAAGGCCGAAGTGCCGGTTTATGC^^ 

>E TIARLARDGKAEVPVYAIGADRRVATRT F E 

106195 GTCGCCGGATCGCCACTTTTCGTCGCCGAAGGGATTTTCGCCGCCGAGATCGTCGAGGAATC 
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>VAGS PLFVAEG I FAAEIVEECRRRGLLAGAY 
106287 CGCGCTGCGCCGGCCGCGCGGCACCACCTTTTTCCGGCGGCTCGCCCGCGACCTGGCCGAGCAGCGCAAGGCTCCCGGGATGCTGCTGCGGr 

>ALRRPRGTTFFRRLARDLAEQRKAPGMLLK 
10637 9 GCGGCCTGGCCCTGCTGCGCGCGGAGCCGGCGGTGCTGCGCCGCCAGGCGGGGCTCGGCGCTCACCCGGCCCCGGCCCGCGAGGTGCTGCGC 

>RGLALLRAEPAVLRRQAGLGAHPAPAREVLF 
1064 71 CGGGTGGCCGACCTGCTCGCCGGCCACCCGCACC ACCCCTGATCAGCCCAGCAGCTTCCCGTACGCCGGCTTG ATCACCTCGTCGATGAT 

>RVADLLAGHPHHP • -GLLKGYAPKIVEDII 
106561 GGCCAGCCGCTCGTCGAACGGGATGAACGCGCTCTTCATCGCGTTGATGGTGAACC ATTGGAGCTCCTTCCAGCCGTAGCCGAAGGCCTCCG 

<ALREDFPI FASKMANITFWQLEKWGYGFAEA 
10665 3 CCAGCAGCGCCATCTCCCGGGACATCGAGGTGCCGCTCATCAGCCGGTTGTCGGTGTTCACCGTCACCCGGAACCGCAGATCGCGCAG 

< LLAMERSMSTGSMLRNDTNVTVRFRLDRLL 

BamHI 
3 uric t ion marker 

1067 4 5 CCGATCGGGTGCTCGGCGATCGACGCCGCCGCGCCGGTCTGCACGTTCGACGACGGGCACAGCTCCAGCGGGATCCGCTTGTCCCGC ACGTA 
<G I PHEAI SAAAGTQVNSSPCLELPI RKDRVY 

10 6 8 37 CGCGGCCAGCCGGCCCAGCACGGGCGGGTCGCCGGGGGTGATGTCGTCCACGATGCGCACCCCGTGGCCGAGGCGGTCCGCGCCGCACCACT 
<AALRGLVPPDG PT I DDVIRVGHGLRDAGCWQ 

1069 2 9 GGATGGCCTGCCAGATCGACGGCAGCCCGAACGCCTCGCCGGCGTGAATGGTGAAGTGGAAGTTCTCCCGCTGCAGGTACTCG AAGGCGTCC 

< I A Q W I S PLGFAEGAHITFHFNERQLYEF AD 
107 021 AGGTGCCGGGTGGGCGGGAATCCCGCCTCCGCCCCGGCGATGTCGAAGCCCACCACGCCGGCGTCGCGGTGCCGC ACCGCC AGTTCGGCGAT 

<L HRTPPFGAEAGAI DFGVVGADRHRVALEA I 
107113 CTCCTGCGACCGGGCGGCGTGCCGCATGGCGGTGAGCAGGGTGCCGACCCGGATCGGATGGCCGGCGTCGGCGGCGAGCGCCGCGCCCTCGG 

<EQSRAAHRMATLLTGVRIPHGADAALAAGEA 
107205 CGAACCCGGCGACGACCGCCTCGACCACCTCGTCCAGGGTCAGGTCCCGCTCCAGGTGCTGCTCGGGGGCGAACCGCACCTCGGCGTAGACG 

< FGAVVAEVVEDLTLDRELHQEPAFRVEAYV 
107297 ACCCCGTCGGCGGCCAGGTCCAGCGCGCACTCCTGGGCCACCCGCCGCAGTGCGGGCGCGGTCTGCATGACCGCCACGGTGTGGGCGAACGT 

<V GDAALDLACEQAVRRLAPATQMVAVTHAFT 
107389 CTCCAGGTAGCGCTCCAGCGAGCCGGAGTTCGCCGCCGCGACGAACCAGCGGCCGAGCGCTTCCGGGTCGGTGGTGGGCAGCTCGTGGCCGA 

<ELYR.ELSGSNAAAVFWRGLAEPDTTPLEHGV 
1074 81 CCTCGGCGGCCAGCTCGACGATCGTCGCCGGCCGCAGGCCGCCGTCGAGGTGGTCGTGCAGCAGCGCCTTGGGGACCTTGACGATGTCCTCG 

< EAALEVITAPRLGGDLHDHLLAKPVKVIDE 
107573 TATGAGATTGCGACCATGCCCAGACCCTAGTAGCGACCGGCCGGGTCGCCGACGCGACCGGCTGGGAGGATGTCCAGGTGATGGACCCC 

<YSIAVM >MDP 
107 6 62 CGCATCGTCGACCGGCTGCGTTGCCCGGTCTGCGCGGAACCGCTCACCGAGGCCGCCGCCGGGACCACCCGGGCGCTGCGCTGCCCGCGCCG 

>R IVDRLRC PVCAEPLTEAAAGTTRALRC PRR 
1077 54 GCACAGCTTCGACGTGGCCCGCCAGGGGTACGTCGACCTGCTCGCCGGCCGGGCCCCGCACGTGGGCGACACCGCCGAGATGGTGGCCGCCC 

> HSFDVARQGYVDLLAGRAPHVGDTAEMVAA 
107846 GCGCCGACTTCCTCGCCGCCGGGCACTACGACACGCTCTCGGCCGCGCTCGCCGCCGCCGCGCTCGCCGCGCTGAGCCACCCGCCGGAGGCC 

>R ADFLAAGHYDTLSAALAAAALAALSHPPEA 
107938 CCCGGAGCGGACGCGTCGGCCGGCAAAGACGGGCAGGATGCCCAAGCCGGGCGGGATGCGTCCGCTGGACATC 

>PGADASAGKDGQDAQAGRDASAGHDASAGQ P 
10 803 0 GGCCGTCGGGACGTACCCGCTGGTGGTGGACGCCGGGGCGGGCACGGGCCGGCACCTCGCCGCGGTGCTGGCGGCGCTGCCCGACGCCGTGG 

>AVGTY PLVVDAGAGTGRHLAAVLAALPDAV 
108122 GCCTGGCCCTGGACGTCTCCAAGCCGGCGCTGCGCCGGGCAGCGCGGGCCCACCCGCGCGCGGCGGCGGCGCTCGCCGAC ACCTGGCGGCGG 

>GLALDVSKPALRRAARAHPRAAAALADTWRR 
108214 CTTCCGCTGGCCGACGCCAGCGTCGCCGTGCTGCTCGACGTCTTCGCCCCGCGCAACGGCGCGGAGTTCCGCCGGGTGCTCCACCCGGCCGG 

>LPLADASVAVLLDVFAPRNGAEFRRVLHPAG 
10 83 0 6 CGCGCTGCTCGTGGTCACCCCCGCCGAGGA-CrtCCTCGCCGAACTGGTCGACTCGCTCGACCTGCTGAAGGTCGACCCCGAC AAGGCGGACC 

>ALLVVTPAEDHLAELVDSLDLLKVDPDKAD 
108398 GGGTCGCCGGGAGCCTGGCCGGCCACTTCGAGCAG ACCGCCGAGAGCGTGCTGCGGGCCCGGCTGGAACTC ACCGGCCGGCAGGTGGCCACC 

>R VAGSL.AGHFEQTAESVLRARLELTGRQVAT 
108490 CTGGTCGGGATGGGACCCAGCGCCTGGCACACCGACCCGGCCACCCTCGCCGCCCGGATCGCCGCGCTACCCGAGCCGGTCCGGGTGACCCT 

>LVGMGPSAWHTDPATLAARIAALPEPVRVTL 

108582 CGCGGTACGGCTCGGOSTGTACCGCCCCCGCTGACCGGGGC^ 

>AVRLGVYRPR <.TSLDVEEWGPPE 
10867 4 CTCGTGGTAGGGCCCTCGCAGGACCACCGCCCACTCCAGCGCCC ACCGGCGCTGCCCGATCGCGTTGGCGTCG ACCAGGCCGGGCGGCGACC 

<EHYPGRLVVAWELAWRRQGIANADVLGPPSR 
108766 GCCCGTCGCGCTCCAGCTCCAGGTACGCCCAGTCGAGGCAGTAGTGCAGGTCGAGCAGGGCCGCCGCGTCGGCCGGGTGCTGGGGCGCGGCC 

<GDRELELYAWDLCYHLDLLAAADAPHQPAA 
108858 AGGATGCGGGAGCGCCACTGCTGGAAGCTCTCCCCGCCCGCGATGTGCGGCAGCCGCTCCACCAGCCGCTCGTCGACCGGCAGCGTCGGGTC 

<LIRSRWQQFSEGGAIHPLREVLREDVPLTPD 
108950 GAGCTGCTTGGCCAGGCCGAGCACCCAGGCCAGCGAGAACAGCGCGTCGT^TGCAGCACG^ 

<LQKALGLVWALSFLADHHLVFSRHDGRGGMV 
109042 CGAACTGCC ACTCCGGCGGGGTGACCAGGTCG ACCAGGTGGGAGTTGAGC AGCC AGCTCATCGCCGCCTGCGCCGGCATGCCG AAACACCGG 

< FQWEPPTVLDVLHSNLLWSMAAQAPMGFCR 
109134 GCCAGGATCACGTGCAGCACGGCGATGCGCGCCTCGATCTCGACGGTCGGCCGCAGCTCGATCTCGTCGC 

<A LIVHLVAIRAEI EVTPRLEI EDGPEWV 
109225 ACTGGCTCGGTGGCAGCGGCAGCCCCAGCCGGGACAGCTCGTCCAGGCTGGCGTCA 

109 317 GCTCAGATCCCTGTCAGTCGCATCGGCTCAGTGCCGGTCGTCCCCCTTGGCCTGGGAGGATAGCGGTTC 

1094 09 CGGGGGCGGGGGCGGTTCAGCCGATCCGCTCGATG ACCAGCGGCTGCGGGGTCGGGGCGGTCGGCGAGATCCGTACCGCCCGGACCGCCTCG 
BamHI 

10 9501 GCCAGCGCCGCCGGGATCC 
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